Issue with pandas read_excel() function

grady37 · October 29, 2024, 3:47pm

Hi all,

I am working on a lambda cloud jupyter notebook. When trying to load an excel file as a pandas dataframe, I’m getting the error, “ImportError: Missing optional dependency ‘openpyxl’. Use pip or conda to install openpyxl.”

I have tried to solve it by installing openpyxl within the notebook, importing it in my code, etc, but I have yet to find a solution that works.

One thing that I think might be contributing to the issue is the fact that, when installing openpyxl and pandas, they are being installed to the location “/home/ubuntu/.lambda/lib/python3.10/site-packages.” However, my working environment is in “/usr/bin/python3.” I’m not sure if this is relevant, though.

Has anyone had a similar issue? Any ideas on how to potentially fix it?

cody_b · November 2, 2024, 9:38pm

Hi,

Lambda’s engineers are working on a permanent fix.

For a temporary workaround, please run the following command:

echo "c.InteractiveShellApp.exec_lines = ['import sys; sys.path.append(\"/home/ubuntu/.lambda/lib/python3.10/site-packages\")']" \
>> ~/.ipython/profile_default/ipython_config.py

Then, try the notebook again.

Edit: The above command is just a workaround.

Jordan_U · November 20, 2024, 8:18pm

Hi,

As the person at lambda who should have tested this better, I’d like to apologize.

The goal was to have Jupyter installed inside a virtual environment, while still having notebooks and interactive terminals started through Jupyter using the default namespace. On the first attempt, we fell short on that second part.

If you launch an instance now, you should find that !pip install in notebooks works again, and that running “pip list -v | grep home” on a fresh instance returns no results, meaning that the Jupyter install should not conflict with any of the dependencies of your machine learning projects. Also, no amount of "pip install"s in the base environment should prevent the Jupyter service from running as expected.

Sorry again, and @grady37 , if you would like to test this new configuration and our new ARM64 GH200 instances, I’m happy to provide you with 1 hour’s worth of cloud credits to use on GH200 instances.

Just reply with the email address associated with your lambda cloud account, or file a support ticket and mention “I’m @grady37 on the deeptak forums and Jordan said he would add an hour’s worth of GH200 credits to my account” . (so that you’re not publicly posting your email address here)

Topic		Replies	Views
Cannot run jupyter notebooks, installed modules are not found Technical Help	4	89	November 21, 2024
I am used to the Jupyter Notebook	2	1450	January 21, 2021
"notebook" is not a recognized Jupyter subcommand? Technical Help	0	1797	March 4, 2019
Jupyter not displaying Technical Help	3	80	November 24, 2024
Upgrading to later version of Python Deep Learning: Getting Started	9	4719	June 26, 2023

Issue with pandas read_excel() function

Related topics