How do i print a column length in python?

This article describes how to get the number of rows, columns, and total number of elements [size] of pandas.DataFrame and pandas.Series.

  • pandas.DataFrame
    • Display number of rows, columns, etc.: df.info[]
    • Get the number of rows: len[df]
    • Get the number of columns: len[df.columns]
    • Get the number of rows and columns: df.shape
    • Get the number of elements: df.size
    • Notes when specifying index
  • pandas.Series
    • Get the number of elements: len[s], s.size

As an example, use Titanic survivor data. It can be downloaded from Kaggle.

import pandas as pd

df = pd.read_csv['data/src/titanic_train.csv']

print[df.head[]]
#    PassengerId  Survived  Pclass  \
# 0            1         0       3   
# 1            2         1       1   
# 2            3         1       3   
# 3            4         1       1   
# 4            5         0       3   
# 
#                                                 Name     Sex   Age  SibSp  \
# 0                            Braund, Mr. Owen Harris    male  22.0      1   
# 1  Cumings, Mrs. John Bradley [Florence Briggs Th...  female  38.0      1   
# 2                             Heikkinen, Miss. Laina  female  26.0      0   
# 3       Futrelle, Mrs. Jacques Heath [Lily May Peel]  female  35.0      1   
# 4                           Allen, Mr. William Henry    male  35.0      0   
# 
#    Parch            Ticket     Fare Cabin Embarked  
# 0      0         A/5 21171   7.2500   NaN        S  
# 1      0          PC 17599  71.2833   C85        C  
# 2      0  STON/O2. 3101282   7.9250   NaN        S  
# 3      0            113803  53.1000  C123        S  
# 4      0            373450   8.0500   NaN        S  

Get the number of rows, columns, elements of pandas.DataFrame

Display number of rows, columns, etc.: df.info[]

The info[] method of pandas.DataFrame can display information such as the number of rows and columns, the total memory usage, the data type of each column, and the number of non-NaN elements.

df.info[]
# 
# RangeIndex: 891 entries, 0 to 890
# Data columns [total 12 columns]:
# PassengerId    891 non-null int64
# Survived       891 non-null int64
# Pclass         891 non-null int64
# Name           891 non-null object
# Sex            891 non-null object
# Age            714 non-null float64
# SibSp          891 non-null int64
# Parch          891 non-null int64
# Ticket         891 non-null object
# Fare           891 non-null float64
# Cabin          204 non-null object
# Embarked       889 non-null object
# dtypes: float64[2], int64[5], object[5]
# memory usage: 83.6+ KB

The result is standard output and cannot be obtained as a value.

Get the number of rows: len[df]

The number of rows of pandas.DataFrame can be obtained with the Python built-in function len[].

In the example, it is displayed using print[], but len[] returns an integer value, so it can be assigned to another variable or used for calculation.

Get the number of columns: len[df.columns]

The number of columns of pandas.DataFrame can be obtained by applying len[] to the columns attribute.

print[len[df.columns]]
# 12

Get the number of rows and columns: df.shape

The shape attribute of pandas.DataFrame stores the number of rows and columns as a tuple [number of rows, number of columns].

print[df.shape]
# [891, 12]

print[df.shape[0]]
# 891

print[df.shape[1]]
# 12

It is also possible to unpack and store them in separate variables.

  • Unpack a tuple and list in Python

row, col = df.shape
print[row]
# 891

print[col]
# 12

Get the number of elements: df.size

The total number of elements of pandas.DataFrame is stored in the size attribute. This is equal to the row_count * column_count.

print[df.size]
# 10692

print[df.shape[0] * df.shape[1]]
# 10692

Notes when specifying index

When a column of data is specified as an index by the set_index[] method, these columns are removed from the data body [values attribute], so it is not counted as the number of columns.

df_multiindex = df.set_index[['Sex', 'Pclass', 'Embarked', 'PassengerId']]

print[len[df_multiindex]]
# 891

print[len[df_multiindex.columns]]
# 8

print[df_multiindex.shape]
# [891, 8]

print[df_multiindex.size]
# 7128

See the following article for set_index[].

  • pandas: Assign existing column to the DataFrame index with set_index[]

Get the number of elements of pandas.Series

As an example of pandas.Series, select one row from pandas.DataFrame.

s = df['PassengerId']
print[s.head[]]
# 0    1
# 1    2
# 2    3
# 3    4
# 4    5
# Name: PassengerId, dtype: int64

Get the number of elements : len[s], s.size

Since pandas.Series is one-dimensional, you can get the total number of elements [size] with either len[] or size attribute.

Note that the shape attribute is a tuple with one element.

print[len[s]]
# 891

print[s.size]
# 891

print[s.shape]
# [891,]

There is no info[] method in pandas.Series.

How do I get the length of a column in Python?

Get the string length of the column – python pandas.
Syntax for string len[] function in python:.
Example of get the length of the string of column in a dataframe in python:.
Create dataframe:.
Example 2 – Get the length of the integer of column in a dataframe in python:.

How do I print the length of a Dataframe in Python?

Get the number of rows: len[df] DataFrame can be obtained with the Python built-in function len[] . In the example, it is displayed using print[] , but len[] returns an integer value, so it can be assigned to another variable or used for calculation.

How do you print column data in Python?

3 Easy Ways to Print column Names in Python.
Using pandas. dataframe. columns to print column names in Python. ... .
Using pandas. dataframe. columns. ... .
Python sorted[] method to get the column names. Python sorted[] method can be used to get the list of column names of a dataframe in an ascending order of columns..

How do you print the number of rows and columns in Python?

len[] method is used to get the number of rows and number of columns individually.

Chủ Đề