| # Instruction for downloading data from the sft-data repository. | |
| First, you would want to log in and access the huggingface data through using | |
| ```py | |
| from huggingface_hub import login | |
| login() | |
| ``` | |
| Then, you could either download the zip file of the all the sft data folders, which would look like | |
| ```py | |
| from huggingface_hub import hf_hub_download | |
| hf_hub_download(repo_id="LEVI-Project/sft-data", filename="sft-data.zip") | |
| ``` | |
| Notice that the `sft-data.zip` file above has the following structure: | |
| ``` | |
| sft-data | |
| βββ README.md # This README file. | |
| βββ alf # Folder for ALFWORLD. | |
| β βββ alfworld.json # The JSON file for ALFWORLD. | |
| β βββ alf_data_folder # Folder for the ALFWORLD environment. | |
| β βββ alf_image_id_0 # Folder 0 for ALFWORLD image data. | |
| β βββ alf_image_id_1 # Folder 1 for ALFWORLD image data. | |
| β βββ alf_image_id_2 # Folder 2 for ALFWORLD image data. | |
| β βββ alf_image_id_3 # Folder 3 for ALFWORLD image data. | |
| β βββ alf_image_id_4 # Folder 4 for ALFWORLD image data. | |
| βββ blackjack # Folder for blackjack environment in the `gym_cards`. | |
| β βββ blackjack_data_folder # Folder for blackjack image data. | |
| β βββ blackjack.json # The JSON file for blackjack. | |
| βββ ezpoints # Folder for ezpoints environment in the `gym_cards`. | |
| β βββ ezpoints_data_folder # Folder for ezpoints image data. | |
| β βββ ezpoints.json # The JSON file for ezpoints. | |
| βββ points24 # Folder for points24 environment in the `gym_cards`. | |
| β βββ points24_data_folder # Folder for points24 image data. | |
| β βββ points24.json # The JSON file for points24. | |
| βββ numberline # Folder for numberline environment in the `gym_cards`. | |
| βββ numberline_data_folder # Folder for numberline image data. | |
| βββ numberline.json # The JSON file for numberline. | |
| ``` | |
| Also, you could choose to download the files for any environment out of the five ones. For example, you should be using the following code for downloading data from blackjack. | |
| ```py | |
| from huggingface_hub import hf_hub_download | |
| hf_hub_download(repo_id="LEVI-Project/sft-data", filename="blackjack.zip") # zip folder for image data folder | |
| hf_hub_download(repo_id="LEVI-Project/sft-data", filename="blackjack.json") # JSON file | |
| ``` | |
| For ALFWORLD, notice that the zip file for the image data folder is `alf_data_folder.zip`. |