This Python script reads data from an Excel file, processes it, and converts it into JSON format.
- Determine Sheet Name: Automatically detects and uses the first sheet name from the Excel file.
- Identify Tables: Identifies tables within the sheet, even if there are multiple tables.
- Clean Data: Cleans the data by removing empty rows and ensuring column names are properly formatted.
- Convert Date to Words: Converts date strings to a more readable format.
- Evaluate Formulas: Evaluates formulas within the sheet and retrieves their computed values.
- Serialize to JSON: Serializes the tables into a JSON file.
- Python 3.6+
openpyxl
library
-
Clone the repository or download the script file.
-
Navigate to the project directory.
-
Create a virtual environment (optional but recommended):
python3 -m venv .venv source .venv/bin/activate
-
Install the required dependencies:
pip install openpyxl
-
Place your Excel file (e.g.,
example_0.xlsx
) in the project directory. -
Modify the script if needed to change the input file name, sheet name, and output file name:
if __name__ == "__main__": input_excel_file = Path("example_0.xlsx") sheet_name = 'Analysis Output' output_json_file = 'output.json' main(input_excel_file, sheet_name, output_json_file)
-
Run the script:
python3 main_script.py
-
The 'output.json' file will be created in the project directory.