ArchiveOrg_Convert_Items_to.../README.md
2024-01-28 02:10:35 +00:00

768 B

Overview:

Use the following set of scripts to process Archive.Org Items into downloadable URL's that can be bulk downloaded using other scripts.

Requirements:

  • Python3
  • IA CLI Tool (internetarchive)

Steps:

  1. Before using the above script you will need to generate a list of IA Items usingthe IA CLI Tool (ia search 'collection:XXXXXX' --itemlist > XXXXX_items.txt). Additional paramaters can be used to filter items as needed.
  2. Create a location to store all of the files you will need for the project.

Update the following lines

input_file_path = "/tmp/PROJECTNAME/COLLECTIONNAME_archivebot_items.txt" output_file_path = "/tmp/PROJECTNAME/all_extracted_names.txt"

Notes:

  • TBD