datechnoman
  • Joined on 2023-03-10
datechnoman pushed to main at ArchiveTeam/Migrated_ArchiveOrg_CDX_Stats_... 2023-12-27 08:46:40 +00:00
182c58f1ce Implemented subprocess for running multiple json extractions at once
datechnoman pushed to main at ArchiveTeam/Migrated_ArchiveOrg_CDX_Stats_... 2023-12-26 11:59:48 +00:00
8496391064 Update to remove tophosts output
datechnoman created branch main in ArchiveTeam/Migrated_ArchiveOrg_CDX_Stats_... 2023-12-23 04:39:05 +00:00
datechnoman pushed to main at ArchiveTeam/Migrated_ArchiveOrg_CDX_Stats_... 2023-12-23 04:39:05 +00:00
ae515fb425 Upload files to "/"
datechnoman created repository ArchiveTeam/Migrated_ArchiveOrg_CDX_Stats_... 2023-12-23 04:36:14 +00:00
datechnoman pushed to main at ArchiveTeam/CommonCrawl_URL_Processor 2023-12-22 01:06:34 +00:00
cba96e96e7 Rollback of change
datechnoman transferred repository datechnoman/CommonCrawl_WAT_Path_Comparer to ArchiveTeam/Migrated_CommonCrawl_WAT_Path_... 2023-12-21 09:53:36 +00:00
datechnoman pushed to main at ArchiveTeam/CommonCrawl_URL_Processor 2023-12-21 09:35:43 +00:00
bfc13cb6ef Updated script to keep regenerating a list of files to download
datechnoman pushed to main at ArchiveTeam/CommonCrawl_URL_Processor 2023-12-21 01:58:59 +00:00
50e89b9de2 Add in checking for new files once list is depleted
datechnoman pushed to main at ArchiveTeam/CommonCrawl_URL_Processor 2023-12-20 04:09:50 +00:00
0aad853966 Update README.md
datechnoman pushed to main at ArchiveTeam/CommonCrawl_URL_Processor 2023-12-20 04:09:17 +00:00
5f152307f2 Add commoncrawl_local_to_share_move.ps1
datechnoman pushed to main at ArchiveTeam/CommonCrawl_URL_Processor 2023-12-19 00:23:58 +00:00
fd9376cbe0 Updated to extract Pastebin URL's
datechnoman pushed to main at ArchiveTeam/CommonCrawl_URL_Processor 2023-12-18 04:34:21 +00:00
1036de64a7 Documentation Update
datechnoman pushed to main at ArchiveTeam/CommonCrawl_URL_Processor 2023-12-18 04:27:32 +00:00
727d2c3187 Update commoncrawl_transfer.ps1
datechnoman pushed to main at ArchiveTeam/CommonCrawl_URL_Processor 2023-12-18 04:27:07 +00:00
171d3e2d2d Upload files to "/"
datechnoman pushed to main at ArchiveTeam/Migrated_CommonCrawl_WAT_Path_... 2023-12-15 10:36:41 +00:00
10c2658bff Update commoncrawl_wat_path_comparer.py
datechnoman created branch main in ArchiveTeam/Migrated_CommonCrawl_WAT_Path_... 2023-12-15 10:35:06 +00:00
datechnoman pushed to main at ArchiveTeam/Migrated_CommonCrawl_WAT_Path_... 2023-12-15 10:35:06 +00:00
7d5b3653c6 Upload files to "/"
datechnoman created repository ArchiveTeam/Migrated_CommonCrawl_WAT_Path_... 2023-12-15 09:05:01 +00:00
datechnoman pushed to main at ArchiveTeam/All_URL_Extractor 2023-12-13 11:32:19 +00:00
112814dd35 Updated script to support .txt files