Python 函数处理 Excel 数据：从入门到进阶317

Python 凭借其强大的库和易用性，成为处理 Excel 数据的利器。本文将深入探讨如何使用 Python 函数高效地操作 Excel 文件，涵盖从基础的读取写入到高级的数据处理和分析技巧，并提供丰富的代码示例。

在 Python 中，处理 Excel 数据最常用的库是 `openpyxl` 和 `xlrd`、`xlwt` (用于较旧的 .xls 文件)以及 `pandas`。 `openpyxl` 能够读取和写入xlsx格式的文件（Excel 2007 及以后版本），而 `xlrd` 和 `xlwt` 则主要用于处理较旧的 xls 格式文件。 `pandas` 则提供了一个更高层次的抽象，使得数据操作更加便捷高效。我们接下来会主要介绍 `openpyxl` 和 `pandas`，因为它们功能更全面，且更常用于现代的 Excel 文件处理。

一、使用 openpyxl 读取和写入 Excel 文件

openpyxl 提供了简洁的 API 来访问和修改 Excel 工作簿和工作表。以下是一个读取 Excel 文件并打印特定单元格值的例子：```python
from openpyxl import load_workbook
def read_excel_cell(filepath, sheet_name, cell_coordinate):
"""读取 Excel 文件中指定单元格的值。"""
try:
workbook = load_workbook(filepath, data_only=True) # data_only=True 读取计算后的值，而不是公式
sheet = workbook[sheet_name]
cell_value = sheet[cell_coordinate].value
return cell_value
except FileNotFoundError:
return "File not found"
except KeyError:
return "Sheet not found"
except Exception as e:
return f"An error occurred: {e}"
filepath = ""
sheet_name = "Sheet1"
cell_coordinate = "A1"
value = read_excel_cell(filepath, sheet_name, cell_coordinate)
print(f"The value of cell {cell_coordinate} is: {value}")
```

写入 Excel 文件同样简单：```python
from openpyxl import Workbook
def write_excel_cell(filepath, sheet_name, cell_coordinate, value):
"""写入 Excel 文件中指定单元格的值。"""
try:
workbook = Workbook()
sheet =
= sheet_name
sheet[cell_coordinate] = value
(filepath)
return True
except Exception as e:
return f"An error occurred: {e}"
filepath = ""
sheet_name = "Sheet1"
cell_coordinate = "B1"
value = "Hello, world!"
success = write_excel_cell(filepath, sheet_name, cell_coordinate, value)
print(f"Writing to Excel {'succeeded' if success else 'failed'}: {success}")
```

二、使用 pandas 处理 Excel 数据

Pandas 提供了强大的 DataFrame 结构，极大地简化了 Excel 数据的处理。它可以轻松地读取、写入、操作和分析 Excel 数据。```python
import pandas as pd
def process_excel_with_pandas(filepath):
"""使用 pandas 读取、处理和写入 Excel 数据。"""
try:
# 读取 Excel 文件
df = pd.read_excel(filepath)
# 数据处理示例：计算某列的平均值
average_value = df['Column Name'].mean() #替换 'Column Name' 为实际列名
print(f"The average value of 'Column Name' is: {average_value}")
# 数据处理示例：添加新列
df['New Column'] = df['Column Name'] * 2
# 写入修改后的数据到新的 Excel 文件
df.to_excel("", index=False)
return True
except FileNotFoundError:
return "File not found"
except KeyError:
return "Column not found"
except Exception as e:
return f"An error occurred: {e}"
filepath = ""
success = process_excel_with_pandas(filepath)
print(f"Processing Excel {'succeeded' if success else 'failed'}: {success}")
```