| 859 | 859 |
"cell_type": "markdown",
|
| 860 | 860 |
"metadata": {},
|
| 861 | 861 |
"source": [
|
| 862 | |
"运算|函数|\n",
|
| 863 | |
"--- | --- \n",
|
|
862 |
"运算符|函数|\n",
|
|
863 |
":---: | :---: \n",
|
| 864 | 864 |
"`==` | `equal`\n",
|
| 865 | 865 |
"`!=` | `not_equal`\n",
|
| 866 | 866 |
"`>` | `greater`\n",
|
| 867 | 867 |
"`>=` | `greater_equal`\n",
|
| 868 | 868 |
"`<` | `less`\n",
|
| 869 | 869 |
"`<=` | `less_equal`\n",
|
| 870 | |
"`^` | `bitwise_xor`\n",
|
| 871 | |
"`~` | `invert`\n",
|
| 872 | |
"`>>` | `right_shift`\n",
|
| 873 | |
"`<<` | `left_shift`\n",
|
| 874 | 870 |
"\n",
|
| 875 | 871 |
"数组元素的比对,我们可以直接使用运算符进行比较,比如判断数组中元素是否大于某个数:"
|
| 876 | 872 |
]
|
|
| 936 | 932 |
"cell_type": "markdown",
|
| 937 | 933 |
"metadata": {},
|
| 938 | 934 |
"source": [
|
| 939 | |
"比如判断数组某个区间的元素是否存在大于 `20`:"
|
|
935 |
"比如判断数组某个区间的元素是否存在大于 `20`的元素:"
|
| 940 | 936 |
]
|
| 941 | 937 |
},
|
| 942 | 938 |
{
|
|
| 959 | 955 |
"cell_type": "markdown",
|
| 960 | 956 |
"metadata": {},
|
| 961 | 957 |
"source": [
|
| 962 | |
"读写各种格式的文件,如下表所示:\n",
|
| 963 | |
"\n",
|
| 964 | |
"文件格式|使用的包|函数\n",
|
| 965 | |
"----|----|----\n",
|
| 966 | |
"txt | numpy | loadtxt, genfromtxt, fromfile, savetxt, tofile\n",
|
| 967 | |
"csv | csv | reader, writer\n",
|
| 968 | |
"Matlab | scipy.io | loadmat, savemat\n",
|
| 969 | |
"hdf | pytables, h5py| \n",
|
| 970 | |
"NetCDF | netCDF4, scipy.io.netcdf | netCDF4.Dataset, scipy.io.netcdf.netcdf_file\n",
|
| 971 | |
"**文件格式**|**使用的包**|**备注**\n",
|
| 972 | |
"wav | scipy.io.wavfile | 音频文件\n",
|
| 973 | |
"jpeg,png,...| PIL, scipy.misc.pilutil | 图像文件\n",
|
| 974 | |
"fits | pyfits | 天文图像\n"
|
| 975 | |
]
|
| 976 | |
},
|
| 977 | |
{
|
| 978 | |
"cell_type": "markdown",
|
| 979 | |
"metadata": {},
|
| 980 | |
"source": [
|
| 981 | 958 |
"`savetxt` 可以将数组写入文件,默认使用科学计数法的形式保存:"
|
| 982 | 959 |
]
|
| 983 | 960 |
},
|
| 984 | 961 |
{
|
| 985 | 962 |
"cell_type": "code",
|
| 986 | |
"execution_count": null,
|
|
963 |
"execution_count": 2,
|
| 987 | 964 |
"metadata": {},
|
| 988 | 965 |
"outputs": [],
|
| 989 | 966 |
"source": [
|
|
| 1533 | 1510 |
"<img src=\"http://imgbed.momodel.cn/scikitlearn.png\" width=300 />\n",
|
| 1534 | 1511 |
"\n",
|
| 1535 | 1512 |
"+ **Python** 语言的机器学习工具\n",
|
| 1536 | |
"+ `Scikit-learn` 包括许多知名的机器学习算法的实现\n",
|
| 1537 | |
"+ `Scikit-learn` 文档完善,容易上手,丰富的 `API`"
|
|
1513 |
"+ `Scikit-learn` 包括大量常用的机器学习算法\n",
|
|
1514 |
"+ `Scikit-learn` 文档完善,容易上手"
|
| 1538 | 1515 |
]
|
| 1539 | 1516 |
},
|
| 1540 | 1517 |
{
|
|
| 1802 | 1779 |
"\n",
|
| 1803 | 1780 |
"划分比例:\n",
|
| 1804 | 1781 |
"+ 训练集:70% 80% 75%\n",
|
| 1805 | |
"+ 测试集:30% 20% 30%\n",
|
|
1782 |
"+ 测试集:30% 20% 25%\n",
|
| 1806 | 1783 |
"\n",
|
| 1807 | 1784 |
"<br>\n",
|
| 1808 | |
"`sklearn.model_selection.train_test_split(arrays, *options)`\n",
|
|
1785 |
"`sklearn.model_selection.train_test_split(x, y, test_size, random_state )`\n",
|
| 1809 | 1786 |
" + `x`:数据集的特征值\n",
|
| 1810 | 1787 |
" + `y`: 数据集的标签值\n",
|
| 1811 | 1788 |
" + `test_size`: 如果是浮点数,表示测试集样本占比;如果是整数,表示测试集样本的数量。\n",
|