ia_metadata_to_download_links.py | ||
README.md |
Overview:
Use the following set of scripts to process Archive.Org Items into downloadable URL's that can be bulk downloaded using other scripts.
Requirements:
- Python3
- IA CLI Tool (internetarchive)
Steps:
- Before using the above script you will need to generate a list of IA Items usingthe IA CLI Tool (ia search 'collection:XXXXXX' --itemlist > XXXXX_items.txt). Additional paramaters can be used to filter items as needed.
- Create a location to store all of the files you will need for the project.
Update the following lines
input_file_path = "/tmp/PROJECTNAME/COLLECTIONNAME_archivebot_items.txt" output_file_path = "/tmp/PROJECTNAME/all_extracted_names.txt"
Notes:
- TBD