Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Insights workshop reingestion #2981

Merged
merged 17 commits into from
Mar 26, 2024
Merged

Insights workshop reingestion #2981

merged 17 commits into from
Mar 26, 2024

Conversation

anthology-assist
Copy link
Collaborator

@anthology-assist anthology-assist commented Dec 22, 2023

  1. In the Github sidebar, add workshop to work items and the current milestone
  2. In the Github sidebar, make sure to link to a corresponding PR under "Development"
  3. Make sure the branch is merged with the latest master branch
  4. Ensure that there are editors listed in the <meta> block
  5. If it's a workshop, add a <venue>ws</venue> tag
  6. Add events to their relevant SIGs
  7. Look at the venue listing for prior years, and ensure that the new volume titles are consistent. You can do this by clicking on the venue name from a paper page, which will take you to the vendor listing.
  8. Navigate to the event page preview (e.g., https://preview.aclanthology.org/icnlsp-ingestion/events/icnlsp-2021/), and page through, to see if there are any glaring mistakes
  9. Skim through the complete listing, looking for mis-parsed author names.
  10. Download the frontmatter and verify that the table of contents matches at least three randomly-selected papers
  11. Download 3–5 PDFs (including the first and last one) and make sure they are correct (title, authors, page numbers).

Copy link

github-actions bot commented Dec 22, 2023

Build successful. Some useful links:

This preview will be removed when the branch is merged.

@mjpost
Copy link
Member

mjpost commented Jan 12, 2024

Why is this being reingested? I can't find a corresponding issue. Note that this is very out of sync with master, and there has already been a revision against the first paper (#2883), so we need to take care.

@mjpost mjpost changed the title Workshop insights reingestion Insights workshop reingestion Jan 12, 2024
@anthology-assist
Copy link
Collaborator Author

@mjpost The insights workshop reingestion is requested by one of the EACL organizers. They updated the ingestion material to include more papers.

@mjpost
Copy link
Member

mjpost commented Jan 14, 2024

Can you please merge in master? There are 61 files differing.

@mjpost
Copy link
Member

mjpost commented Jan 14, 2024

Please note when you merge that there is a conflict due to the fact that a revision has been processed for an Insights paper. This needs to be handled carefully against the reingestion since the reingestion may reintroduce the original paper.

@mjpost
Copy link
Member

mjpost commented Jan 14, 2024

There are also videos and DOIs that are overwritten by this reingestion. I think it is going to have to be processed more carefully by a custom script.

@anthology-assist
Copy link
Collaborator Author

@mjpost Maybe I'm missing something, but not seeing overitten DOIs and videos after merge in master.

data/xml/2023.insights.xml Outdated Show resolved Hide resolved
data/xml/2023.insights.xml Outdated Show resolved Hide resolved
@mjpost
Copy link
Member

mjpost commented Jan 31, 2024

@anthology-assist did you look at the Files changed tab (always necessary to page through this for every PR). I marked one of them.

@mjpost mjpost added this to the 2024Q1 milestone Feb 16, 2024
@anthology-assist
Copy link
Collaborator Author

@mjpost I think this is finally ready.

Co-authored-by: Matt Post <post@cs.jhu.edu>
@mjpost mjpost merged commit 4c96d2e into master Mar 26, 2024
2 checks passed
@mjpost mjpost deleted the insights-reingestion branch March 26, 2024 00:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants