This notebook is meant to automate/simplify downloading the CORD-19 dataset from kaggle.com.
- We rely on Kaggle's Python API. It should be already installed by our requirements.
- In order for it to work you need:
- A Kaggle account (you will need it anyway to access the data).
- Configure your credentials so the API can authenticate (3 mins process).
- Follow instructions here: https://medium.com/@ankushchoubey/how-to-download-dataset-from-kaggle-7f700d7f9198
Making sure that all requirements are installed.
Preparing destination folder
download_cord19_dataset()