Python数据科学:NumPy Cheat Sheet

简介: Python数据科学:NumPy Cheat Sheet

Key and Imports

In this cheat sheet, we use the following shorthand:


arr | A NumPy Array object


You’ll also need to import numpy to get started:


import numpy as np


Importing/exporting

np.loadtxt(‘file.txt’) | From a text file

np.genfromtxt(‘file.csv’,delimiter=’,’) | From a CSV file

np.savetxt(‘file.txt’,arr,delimiter=’ ‘) | Writes to a text file

np.savetxt(‘file.csv’,arr,delimiter=’,’) | Writes to a CSV file


Creating Arrays

np.array([1,2,3]) | One dimensional array

np.array([(1,2,3),(4,5,6)]) | Two dimensional array

np.zeros(3) | 1D array of length 3 all values 0

np.ones((3,4)) | 3x4 array with all values 1

np.eye(5) | 5x5 array of 0 with 1 on diagonal (Identity matrix)

np.linspace(0,100,6) | Array of 6 evenly divided values from 0 to 100

np.arange(0,10,3) | Array of values from 0 to less than 10 with step 3 (eg [0,3,6,9])

np.full((2,3),8) | 2x3 array with all values 8

np.random.rand(4,5) | 4x5 array of random floats between 0-1

np.random.rand(6,7)*100 | 6x7 array of random floats between 0-100

np.random.randint(5,size=(2,3)) | 2x3 array with random ints between 0-4


Inspecting Properties

arr.size | Returns number of elements in arr

arr.shape | Returns dimensions of arr (rows,columns)

arr.dtype | Returns type of elements in arr

arr.astype(dtype) | Convert arr elements to type dtype

arr.tolist() | Convert arr to a Python list

np.info(np.eye) | View documentation for np.eye


Copying/sorting/reshaping

np.copy(arr) | Copies arr to new memory

arr.view(dtype) | Creates view of arr elements with type dtype

arr.sort() | Sorts arr

arr.sort(axis=0) | Sorts specific axis of arr

two_d_arr.flatten() | Flattens 2D array two_d_arr to 1D

arr.T | Transposes arr (rows become columns and vice versa)

arr.reshape(3,4) | Reshapes arr to 3 rows, 4 columns without changing data

arr.resize((5,6)) | Changes arr shape to 5x6 and fills new values with 0


Adding/removing Elements

np.append(arr,values) | Appends values to end of arr

np.insert(arr,2,values) | Inserts values into arr before index 2

np.delete(arr,3,axis=0) | Deletes row on index 3 of arr

np.delete(arr,4,axis=1) | Deletes column on index 4 of arr


Combining/splitting

np.concatenate((arr1,arr2),axis=0) | Adds arr2 as rows to the end of arr1

np.concatenate((arr1,arr2),axis=1) | Adds arr2 as columns to end of arr1

np.split(arr,3) | Splits arr into 3 sub-arrays

np.hsplit(arr,5) | Splits arr horizontally on the 5th index


Indexing/slicing/subsetting

arr[5] | Returns the element at index 5

arr[2,5] | Returns the 2D array element on index [2][5]

arr[1]=4 | Assigns array element on index 1 the value 4

arr[1,3]=10 | Assigns array element on index [1][3] the value 10

arr[0:3] | Returns the elements at indices 0,1,2 (On a 2D array: returns rows 0,1,2)

arr[0:3,4] | Returns the elements on rows 0,1,2 at column 4

arr[:2] | Returns the elements at indices 0,1 (On a 2D array: returns rows 0,1)

arr[:,1] | Returns the elements at index 1 on all rows

arr<5 | Returns an array with boolean values

(arr1<3) & (arr2>5) | Returns an array with boolean values

~arr | Inverts a boolean array

arr[arr<5] | Returns array elements smaller than 5


Scalar Math

np.add(arr,1) | Add 1 to each array element

np.subtract(arr,2) | Subtract 2 from each array element

np.multiply(arr,3) | Multiply each array element by 3

np.divide(arr,4) | Divide each array element by 4 (returns np.nan for division by zero)

np.power(arr,5) | Raise each array element to the 5th power


Vector Math

np.add(arr1,arr2) | Elementwise add arr2 to arr1

np.subtract(arr1,arr2) | Elementwise subtract arr2 from arr1

np.multiply(arr1,arr2) | Elementwise multiply arr1 by arr2

np.divide(arr1,arr2) | Elementwise divide arr1 by arr2

np.power(arr1,arr2) | Elementwise raise arr1 raised to the power of arr2

np.array_equal(arr1,arr2) | Returns True if the arrays have the same elements and shape

np.sqrt(arr) | Square root of each element in the array

np.sin(arr) | Sine of each element in the array

np.log(arr) | Natural log of each element in the array

np.abs(arr) | Absolute value of each element in the array

np.ceil(arr) | Rounds up to the nearest int

np.floor(arr) | Rounds down to the nearest int

np.round(arr) | Rounds to the nearest int


Statistics

np.mean(arr,axis=0) | Returns mean along specific axis

arr.sum() | Returns sum of arr

arr.min() | Returns minimum value of arr

arr.max(axis=0) | Returns maximum value of specific axis

np.var(arr) | Returns the variance of array

np.std(arr,axis=1) | Returns the standard deviation of specific axis

arr.corrcoef() | Returns correlation coefficient of array

相关文章
|
1天前
|
存储 机器学习/深度学习 数据处理
NumPy:从初识到实战,探索Python科学计算的无限可能
NumPy:从初识到实战,探索Python科学计算的无限可能
38 0
|
1天前
|
SQL 数据采集 数据挖掘
构建高效的Python数据处理流水线:使用Pandas和NumPy优化数据分析任务
在数据科学和分析领域,Python一直是最受欢迎的编程语言之一。本文将介绍如何通过使用Pandas和NumPy库构建高效的数据处理流水线,从而加速数据分析任务的执行。我们将讨论如何优化数据加载、清洗、转换和分析的过程,以及如何利用这些库中的强大功能来提高代码的性能和可维护性。
|
1天前
|
存储 测试技术 程序员
【Python】—— NumPy基础及取值操作
【Python】—— NumPy基础及取值操作
8 1
|
1天前
|
索引 Python
【Python】—— Numpy 初体验
【Python】—— Numpy 初体验
4 0
|
1天前
|
机器学习/深度学习 TensorFlow 算法框架/工具
Python深度学习基于Tensorflow(1)Numpy基础
Python深度学习基于Tensorflow(1)Numpy基础
23 5
|
1天前
|
数据采集 Web App开发 数据可视化
Python爬虫技术与数据可视化:Numpy、pandas、Matplotlib的黄金组合
Python爬虫技术与数据可视化:Numpy、pandas、Matplotlib的黄金组合
|
1天前
|
数据挖掘 数据处理 索引
Python数据分析面试:NumPy基础与应用
【4月更文挑战第16天】了解并熟练运用NumPy是衡量Python数据分析能力的关键。本文探讨了面试中常遇到的NumPy问题,包括数组创建、属性、索引切片、数组运算、统计函数以及重塑和拼接,并提供了相关代码示例。同时强调了易错点,如混淆Python列表与NumPy数组、误解广播规则等,提醒在数据处理中注意性能和内存效率。掌握这些知识点将有助于提升数据分析面试表现和技能。
33 5
|
1天前
|
机器学习/深度学习 数据可视化 数据挖掘
《Python 简易速速上手小册》第9章:数据科学和机器学习入门(2024 最新版)
《Python 简易速速上手小册》第9章:数据科学和机器学习入门(2024 最新版)
25 1
|
1天前
|
存储 数据挖掘 数据处理
Python中数据分析工具NumPy
【4月更文挑战第14天】NumPy是Python的数值计算扩展库,专注于处理大型多维数组和矩阵。其主要特性包括ndarray对象(存储同类型元素的高效数据容器)、广播机制(处理不同形状数组运算)、线性代数与统计函数、随机数生成及与C/C++/Fortran代码集成。作为Pandas等数据分析工具的基础,掌握NumPy对提升数据处理能力至关重要。
9 1
|
1天前
|
Python
用Python的Numpy求解线性方程组
用Python的Numpy求解线性方程组