Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix #8785 #8786

Merged
merged 4 commits into from
Feb 5, 2024
Merged

fix #8785 #8786

merged 4 commits into from
Feb 5, 2024

Conversation

hornc
Copy link
Collaborator

@hornc hornc commented Feb 1, 2024

Closes #8785

Previous behaviour was:

  • If the archive.org item has an open library reference (openlibrary, or openlibrary_edition),
    ignore the require_marc setting and any MARC record present and import the metadata from the archive.org item only.

This is the lesser quality import option, and is only used as a fallback in the normal case.

I don't see what the value of this "if we already know about it, use lesser quality metadata" case is.

This PR makes the /ia import consistent.

It's most likely that OL will already have the same MARC metadata (OL is probably where archive.org got its metadata from in the first place), but re-importing the same MARC should not cause a double up, and it will take advantage of any improvements to the import process, so it will be a proper re-import from the same source.

This should fix #8785 where due to the different way archive.org combined fields into its description, OL was getting redundant fields. Now the /ia import path will always prefer the source MARC when it exists.

Technical

Testing

Screenshot

Stakeholders

@cdrini
@seabelis
@scottbarnes

@hornc hornc added the Module: Import Issues related to the configuration or use of importbot and other bulk import systems. [managed] label Feb 1, 2024
@mekarpeles mekarpeles merged commit 0e10370 into internetarchive:master Feb 5, 2024
3 checks passed
@mekarpeles mekarpeles self-assigned this Feb 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Module: Import Issues related to the configuration or use of importbot and other bulk import systems. [managed]
2 participants