How to Fix Archive.Org Thumbnails

archive_xml_repair_01

One of the annoyances of Archive.org is that thumbnails seem to be assigned randomly instead of properly displaying the cover. If you Google for instructions, you will find them needlessly complex. Here’s the nearly painless way to fix them. After you fix one or two, it will be easy. Almost so easy that a computer could do it.

 

archive_xml_repair_02

In this example, we’re going to fix the thumbnail for the June 1976 issue of “Private Screenings.”

The first step is to get to the XML file that is causing the problem. Under the list of files, go to SHOW ALL.

archive_xml_repair_03

The file name will be whatever URL you have chosen for your file, ending in _scandata.xml.

archive_xml_repair_04
Right click on the file and save it, being sure to rename it from XXX_scandata.xml to something creative like Backup_XXX_scandata.xml. This is so you don’t lose your original in case bad things happen.

archive_xml_repair_05

Open the file in Notepad. Don’t be scared by the coding. It’s easy.

Here is the first part of the problem. At left, a correctly formulated XML file, and at right, our problem child, which thinks that the cover (<page leafNum=”0″>) is <pagetype>Normal</pageType> when it should be <pageType>Title</pageType>.

Change the pageType to Title.

archive_xml_repair_06

Next (and this is really important) you go down to where Archive has improperly assigned the Title page and change it to Normal. Otherwise the XML file will have two Title pages and that makes Archive.org sad.

archive_xml_repair_07

Save the file, being sure to rename it, deleting the Backup prefix. And if you’re using Notepad, be sure to save it as an XML and not Notepad’s default TXT.

archive_xml_repair_08
Now navigate to this screen.

archive_xml_repair_09
Click on Edit and select “change the files in my item.”

archive_xml_repair_10
Click on “Add a file.” You can’t delete the bad XML or edit it online. All you need to do is upload the correct one and it will overwrite the bad one.

archive_xml_repair_11
Go for it!

archive_xml_repair_12
After uploading, click on Add files.

The Archive.org Gargantu-Brain will process the new XML file (this may take some time – be patient) and

archive_xml_repair_13
Voila! All fixed!

About lmharnisch

I am retired from the Los Angeles Times
This entry was posted in 1976, Libraries and tagged , , . Bookmark the permalink.

1 Response to How to Fix Archive.Org Thumbnails

  1. BJMe says:

    Tn Thank you.

    Like

Leave a Reply. Note: Your IP is logged with your comment so a fake name and email address are useless.

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.