This is an introduction to openpyxl, a Python module built for reading and writing Excel document files.
Most often, we would like to automate many of our repetitive tasks on Excel using a programming language.
Python's openpyxl module makes it very easy for us to access an excel file.
Note that Openpyxl is not a built-in module, therefore we need to install it first.
Installing Openpyxl
Run the following code in your Python terminal to install Openpyxl.
Pip install openpyxl
The above code installs the openpyxl module after a few minutes.
To use the openpyxl on our in python, we need to import it.
import openpyxl
Loading our excel file can be done by:
from openpyxl import load_workbook
survey = load_workbook("ourworkbook.xlsx")
The code above uses the imported load_workbook method to read the excel file of interest and stores it in variable "survey"
Note that the file we are accessing must be in the same folder we are working from.
Excel document can contain many sheets, to print all the active sheets in the excel file .
print(survey.sheetnames)
The above code will display the name of the active sheets in the excel files.
To access the first sheet in the document:
first = survey.active
We stored the first sheet in the excel document as variable "first"
Rows and columns in excel file
Accessing column A:
col_A = first["A"]
col_A
To access range of columns:
range_A_to_C = first["A:C"]
print(range_A_to_C)
The range of columns from A to C is stored in the variable "range_A_to_C" as it is shown in the above code, we can thus print it out.
There are quite few nice things we can do with the Openpyxl module in Python, and interested person should check Openpyxl documentation for more information.
Openpyxl has been around for quite some time now and has its own usefulness. However, better tools (eg. pandas) has been developed in Python for better handling of Excel file.
Top comments (0)