Issue
I am trying to read in a csv file directly from a website. Below is the Python3 code:
import pandas as pd
url = "https://www.w3resource.com/python-exercises/pandas/plotting/alphabet_stock_data.csv"
data = pd.read_csv(url)
But I got the following error:
---------------------------------------------------------------------------
HTTPError Traceback (most recent call last)
Input In [6], in <cell line: 3>()
1 import pandas as pd
2 url = "https://www.w3resource.com/python-exercises/pandas/plotting/alphabet_stock_data.csv"
----> 3 data = pd.read_csv(url)
File ~/opt/anaconda3/lib/python3.9/site-packages/pandas/util/_decorators.py:311, in deprecate_nonkeyword_arguments.<locals>.decorate.<locals>.wrapper(*args, **kwargs)
305 if len(args) > num_allow_args:
306 warnings.warn(
307 msg.format(arguments=arguments),
308 FutureWarning,
309 stacklevel=stacklevel,
310 )
--> 311 return func(*args, **kwargs)
Any clue? Many thanks.
Solution
You should specify the storage_options
argument:
import pandas as pd
url = "https://www.w3resource.com/python-exercises/pandas/plotting/alphabet_stock_data.csv"
storage_options = {'User-Agent': 'Mozilla/5.0'}
df = pd.read_csv(url, storage_options=storage_options)
Taken from: https://stackoverflow.com/a/68816828/5304366
Answered By - Adrien Pacifico
0 comments:
Post a Comment
Note: Only a member of this blog may post a comment.