mirror of
https://github.com/end-of-term/eot2024.git
synced 2025-08-17 03:21:46 -07:00
added bulk list from data rescue 2025 inventories (#42)
* Update README.md added bulk list from data rescue 2025 inventories * Add files via upload added bulk list from data rescue 2025 inventories
This commit is contained in:
parent
4f6e19c4fb
commit
81d9e8f745
2 changed files with 127 additions and 0 deletions
|
@ -11,6 +11,12 @@ See [commoncrawl/ccf-eot-seeds-2024](https://github.com/commoncrawl/ccf-eot-seed
|
|||
* ccf-gov-federal-web-graph-2024-jun-jul-aug.txt - all .gov federal hostnames from current-federal.csv domains in CCF's 2024 June/July/August web graph
|
||||
* ccf-mil-web-graph-2024-jun-jul-aug.txt - all .mil hostnames from CCF's 2024 June/July/August web graph
|
||||
|
||||
### Data Rescue inventories
|
||||
|
||||
See [Data Rescue inventories](https://docs.google.com/spreadsheets/d/1OYLn6NBWStOgPUTJfYpU0y0g4uY7roIPP4qC2YztgWY/edit?usp=sharing) for details on the project being coordinated by IASSIST, RDAP, Data Curation Network and other organizations.
|
||||
|
||||
*data-rescue-inventories-20250209.txt. All urls from the Data Rescue inventories.
|
||||
|
||||
### Defenders of Wildlife seeds
|
||||
Seeds submitted by Andrew Carter on behalf of Defenders of Wildlife:
|
||||
* EoT archive submission - DoW 12-19-24.txt
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue