Go to file
2024-01-28 02:10:35 +00:00
ia_metadata_to_download_links.py Update ia_metadata_to_download_links.py 2024-01-28 02:04:01 +00:00
README.md Update README.md 2024-01-28 02:10:35 +00:00

Overview:

Use the following set of scripts to process Archive.Org Items into downloadable URL's that can be bulk downloaded using other scripts.

Requirements:

  • Python3
  • IA CLI Tool (internetarchive)

Steps:

  1. Before using the above script you will need to generate a list of IA Items usingthe IA CLI Tool (ia search 'collection:XXXXXX' --itemlist > XXXXX_items.txt). Additional paramaters can be used to filter items as needed.
  2. Create a location to store all of the files you will need for the project.

Update the following lines

input_file_path = "/tmp/PROJECTNAME/COLLECTIONNAME_archivebot_items.txt" output_file_path = "/tmp/PROJECTNAME/all_extracted_names.txt"

Notes:

  • TBD