25 lines
768 B
Markdown
25 lines
768 B
Markdown
<b>Overview:</b>
|
|
|
|
Use the following set of scripts to process Archive.Org Items into downloadable URL's that can be bulk downloaded using other scripts.
|
|
|
|
<b>Requirements:</b>
|
|
<ul>
|
|
<li>Python3</li>
|
|
<li>IA CLI Tool (internetarchive)</li>
|
|
</ul>
|
|
|
|
<b>Steps:</b>
|
|
|
|
1. Before using the above script you will need to generate a list of IA Items usingthe IA CLI Tool (ia search 'collection:XXXXXX' --itemlist > XXXXX_items.txt). Additional paramaters can be used to filter items as needed.
|
|
2. Create a location to store all of the files you will need for the project.
|
|
|
|
Update the following lines
|
|
|
|
input_file_path = "/tmp/PROJECTNAME/COLLECTIONNAME_archivebot_items.txt"
|
|
output_file_path = "/tmp/PROJECTNAME/all_extracted_names.txt"
|
|
|
|
<b>Notes:</b>
|
|
|
|
<ul>
|
|
<li>TBD</li>
|
|
</ul> |