突发好奇，比较了python和R里面一些包自带的function，发现差别好大

LTkongjianyang

首先是看了下python下pandas下的function

import pandas
print(dir(pandas))

发现还好，大概144个

['ArrowDtype', 'BooleanDtype', 'Categorical', 'CategoricalDtype', 'CategoricalIndex', 'DataFrame', 'DateOffset', 'DatetimeIndex', 'DatetimeTZDtype', 'ExcelFile', 'ExcelWriter', 'Flags', 'Float32Dtype', 'Float64Dtype', 'Float64Index', 'Grouper', 'HDFStore', 'Index', 'IndexSlice', 'Int16Dtype', 'Int32Dtype', 'Int64Dtype', 'Int64Index', 'Int8Dtype', 'Interval', 'IntervalDtype', 'IntervalIndex', 'MultiIndex', 'NA', 'NaT', 'NamedAgg', 'Period', 'PeriodDtype', 'PeriodIndex', 'RangeIndex', 'Series', 'SparseDtype', 'StringDtype', 'Timedelta', 'TimedeltaIndex', 'Timestamp', 'UInt16Dtype', 'UInt32Dtype', 'UInt64Dtype', 'UInt64Index', 'UInt8Dtype', '__all__', '__builtins__', '__cached__', '__deprecated_num_index_names', '__dir__', '__doc__', '__docformat__', '__file__', '__getattr__', '__git_version__', '__loader__', '__name__', '__package__', '__path__', '__spec__', '__version__', '_config', '_is_numpy_dev', '_libs', '_testing', '_typing', '_version', 'annotations', 'api', 'array', 'arrays', 'bdate_range', 'compat', 'concat', 'core', 'crosstab', 'cut', 'date_range', 'describe_option', 'errors', 'eval', 'factorize', 'from_dummies', 'get_dummies', 'get_option', 'infer_freq', 'interval_range', 'io', 'isna', 'isnull', 'json_normalize', 'lreshape', 'melt', 'merge', 'merge_asof', 'merge_ordered', 'notna', 'notnull', 'offsets', 'option_context', 'options', 'pandas', 'period_range', 'pivot', 'pivot_table', 'plotting', 'qcut', 'read_clipboard', 'read_csv', 'read_excel', 'read_feather', 'read_fwf', 'read_gbq', 'read_hdf', 'read_html', 'read_json', 'read_orc', 'read_parquet', 'read_pickle', 'read_sas', 'read_spss', 'read_sql', 'read_sql_query', 'read_sql_table', 'read_stata', 'read_table', 'read_xml', 'reset_option', 'set_eng_float_format', 'set_option', 'show_versions', 'test', 'testing', 'timedelta_range', 'to_datetime', 'to_numeric', 'to_pickle', 'to_timedelta', 'tseries', 'unique', 'util', 'value_counts', 'wide_to_long']

然后看了下R下的dplyr包的functions

library(dplyr)
ls("package:dplyr")

发现居然有297个，接近300个，其中大部分平时都没有用过，其中不乏有这些

[255] "summarise_"            "summarise_all"        
[257] "summarise_at"          "summarise_each"       
[259] "summarise_each_"       "summarise_if"         
[261] "summarize"             "summarize_"           
[263] "summarize_all"         "summarize_at"         
[265] "summarize_each"        "summarize_each_"      
[267] "summarize_if"

这样咋能记得住...

Jonie_Y

LTkongjianyang 这是个好问题，R中的方法，有一些是因为方法分派的问题，就是即使你不记也没关系，它会根据类属性，自动进行方法分派。显得函数多。当然，也有的是相近函数，不同作用。不过吧，学编程不就是个记函数和学语法的过程嘛~~

yydhcl

LTkongjianyang 那你试下 len(dir(pandas.DataFrame))

LTkongjianyang

Jonie_Y 哈哈，说的很有道理，就是有时候会想着把所有函数都过一遍，但是感觉除了常见的一些，其他的都不太能做到...

Jonie_Y

LTkongjianyang 看一遍是好的，即使记不住，起码知道，都有哪些能做到。查也可以知道个方向。免得大海捞针~~

LTkongjianyang

yydhcl 有四百多个，好吓人，根本记不住 ...