National Register and the National Archives: Difference between revisions
National Register and the National Archives (view source)
Revision as of 16:59, 31 July 2025
, 1 month agono edit summary
No edit summary |
No edit summary |
||
Line 61: | Line 61: | ||
The digitized versions of the scanned nomination forms include Optical Character Recognition text. It is likely that improved OCR programs exist since first being digitized. A percentage of the 3 million pages could be sampled and run through new OCR technology to see if better results are possible. | The digitized versions of the scanned nomination forms include Optical Character Recognition text. It is likely that improved OCR programs exist since first being digitized. A percentage of the 3 million pages could be sampled and run through new OCR technology to see if better results are possible. | ||
The size of files stored on the NARA website are often much larger than the casual researcher need and can cause sluggishness when opening up more than one PDF on older devices. The roughly 4TB of data could likely be reduced in size without affecting the legibility of the | The size of files stored on the NARA website are often much larger than the casual researcher need and can cause sluggishness when opening up more than one PDF on older devices. The roughly 4TB of data could likely be reduced in size without affecting the legibility of the documents. A project is in progress to compress all files down so they will fit on a 1TB thumb drive or SSD. | ||
Reach out to hello@openpreservation.xyz if you have data analysis ideas about the 76,092 PDFs of National Register nomination forms available from the NARA website. | Reach out to hello@openpreservation.xyz if you have data analysis ideas about the 76,092 PDFs of National Register nomination forms available from the NARA website. |