-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Link between SE Publication and ORA Object not maintained (was: 2 ORA records linked to one SE object) #145
Comments
The issue regarding 1012237 | uuid_74db6297-62a1-41c4-9210-3915a0cb90ff on Oxris-QAThe publication 1012237 on Oxris QA appears to have a link to ORA object uuid_74db6297-62a1-41c4-9210-3915a0cb90ff. This is visible on the publication page. At some point, the title and file information have been harvested from ORA. The record is present in ORA4-QA-SYMP, and is correctly listed in the OAI-PMH response page. Sword requests to the record return valid and correct responses. However, we cannot harvest the ORA record in Elements. The manual update method (via the Publications page) does not update the record; requesting a full harvest does not update the record. A search of every ORA/Hyrax Repotool2 log from the last three months reveals no entry for either uuid_74db6297-62a1-41c4-9210-3915a0cb90ff or 1012237 in any log. There are entries for 1012230, 1012231, 1012232, 1012234, 1012235 and 1012239. The ORA binary file was, for a time, deleted on ORA-SYMP due to an update error. It is possible that Oxris attempted to harvest the record, only to discover that there was no file present, and took some form of action. There is no record of this that I can find. The object has now been fixed and there is now a file attached to the record - although the status of 'complete (file available)' is probably not correct. The obvious solution is to re-deposit a file and see if that re-establishes the link. However, I don't want to try that in case that transparently fixes the issue (for this record) without us understanding what is happening here. (edited) [Note: it appears that a duplicate record is created in ORA] |
Mentions of 1012115 in the logs
Mentions of uuid_06a27c8c-bb1d-40f8-adfd-a0762c38d6cb in the logs
Mentions of uuid_2485bb48-d745-403b-acb7-e8de984f0065 in the logs
|
For 1012115:
|
1012237 | uuid_74db6297-62a1-41c4-9210-3915a0cb90ff
|
@AndrewBennet action points
|
I have reproduced the behaviour where clicking the full text tab will not attempt to re-fetch an item which the system previously thought was deleted. This is questionable behaviour; next I will look into whether we could adjust this. It's too late to make it into our next release, though, which is out next Thursday. However, I did observe that items are brought back into a non-deleted state after a harvest was run. (At least, that was the case for DSpace and EPrints repositories which I tested on). I wonder, could you trigger some change on the item in ORA, so that it gets picked up in the next request for changes to the OAI-PMH endpoint? Alternatively, you could trigger the Full Harvest (there should be a button to do so in the data source configuration page). Then, let's see whether the deleted item gets woken up. |
Is there any chance of a patch release to fix this in 5.18? I realise this might be easier to ask for than to do :) Re: bringing an item into a non-deleted state - dp you want me to:
|
@tobypitts tests to run (we can do this without WeiHsi's updates)
Error recorded, record is unchanged:
Object updates and the link is fixed
|
Tom - your numbered sequence sounds ideal for testing this behaviour. That's essentially the same sequence I ran through (albeit against DSpace and Eprints repositories, since I don't have a test Hyrax), and the harvest did indeed bring the item back into a non-deleted state. I'll discuss with the team about changing the behaviour of the Full Text tab click - though there may be some technical complications around that. We're also coming up to the final release of the year (5.19 - this Thursday), so we probably won't be able to assess the feasibility of the change until after then. |
I think we've narrowed this down now. So long as there's something in the harvest that enables the two objects to reconnect, we can fix a deleted link. But we need the harvest to do that. I'll leave this open until your update regarding making the deletion of the connection less 'touchy', but I'm moving to post-release development if that's OK with you (given when your update will arrive) |
We have two issues regarding a probable broken link between SE Publications and ORA Objects.
For both issues, the likely cause is that a file was apparently deleted in ORA ('apparently' because the file was not removed, but the fileset id changed [1]). This appears to have severed the link between the SE Publication and the ORA Object, maintained by RT2.
We have not tried re-running the SE Migration tool to re-import a pubs-id/uuid pair into SE to re-establish the link.
[1] we had an issue yesterday where objects moved within the ORA system did not have the ids of their filesets preserved on transfer. This has been fixed in code an awaits deployment to ORA
The text was updated successfully, but these errors were encountered: