Mar 4 - Mar 15, 2024
Report on iteration Mar 4, 2024 — Mar 15, 2024 (includes all issues completed before Mar 18, 2024).
Late summary of activity over the first iteration of March. Quite busy despite the reduced work over spring break.
PPA
- added Ends of Prosody people to the mailing list
- support of Brian/Laure as they work on excerpts issue
- inventory of NLP work from the fall (part 2 next week)
- acceptance testing for 4 issues (2/4 closed)
- undergrad wrangling
- Collected the version information for the PPA excerpts; found that the vast majority of these volumes were mostly recently updated in 2022+ and that at least in one instance the page range has updated since the excerpts were last checked/verified
- PPA-NLP planning
- Completed the fix for the pairtree files issue (#538) and helped resolve the initial failures in testing
- Began work on upgrading PPA to Solr 9 (#572)
- made progress on various ppa rsync and excerpt related improvements
- completed work to recalculate page counts for PPA records
- Revised the NLP portion of the PPA Charter
- Organized Github issues into epics
- Added all of the EEBO-TCP issues into Github
- Tested issues #538 and #567
- Responded to and met with Brian to discuss excerpts fix
- Emailed Jon Stroop who confirmed PUL doesn’t have an easy way to submit things to HT, but suggested we follow up with Esme Cowles who has done it before (apparently it’s only easy if you’re Google or Internet Archive)
AAP
- Obtained Kreike data for local use
- Met with Wangyal to better understand what the shared files are and also gained more general insights about the history of this decades-in-the-making project
- Debriefed with Jeri on my meeting with Wangyal and discussed where this leaves us
Other RSE-related:
- Agile/Scrum/Lean trainings
- Work on project closeout ahead of conversations about GeoTaste and Sim Risk
- RSE team overview of how we create pull requests on GitHub (edited)
PM work
- huge headway on Project Design curriculum
- Staff meeting and discussion on new (revamped) programming
- Project Design planning session with Mary
Other CDH / University work
- Got Mallet installed after running into a Java 21 backwards incompatibility issue (David really is the one to thank here)
- Began prepping for Wouter’s classes on topic modeling (this got delayed by the Mallet issue)
- Grant applications summaries and write up
- Lunch, Dinner, Talk, and Dinner with Jo
- RSE town hall and start planning for a proposal workshop
- attended spring Research Computing Advisory Group meeting and shared notes
- attended intro to pytorch workshop
- two planning meetings for PUL DEI devops fellowship search committee
- Closing out APRs
Active projects
- geniza (0 points, 1 issue)
- ppa-django (0 points, 6 issues)
Releases
Velocity
Development
0 points, 7 issues. Rolling velocity: 1
Closed issues by project
- geniza (1 points, 10 issues)
- development (1 points, 10 issues)
- As a frontend user, I want to see dating information displayed on document details when available, so that I can find out the time frame of a document when it is known. 🆕 enhancement, performant
- As a content editor, I would like to see Historic Shelfmark on the Document edit page, to ensure that my work is correct when working with old scholarship. (1) 🆕 enhancement, performant
- As a content admin, I want a provenance field on the document detail page so that I can note the origin and aquisition history of fragments when available. 🆕 enhancement, performant
- As a content editor, I want there to be a notes field in the places pages so that I can add more detail about places that are hard-to-find. 🆕 enhancement, performant
- eScriptorium line-level ingest and editor 🛠️ chore
- As a content admin, I want to drop down a pin on a map and then be able to move the pin around so that I can manually adjust the coordinates of a place before saving the location. 🆕 enhancement, performant
- As a content editor, I want clearer help text for the name field of the person page so I know how best to present people's names on their pages 🆕 enhancement, performant
- Invalid lat/long coordinates are allowed for Places, but don't persist 🐛 bug, performant
- As a content editor, I want to record places-to-places relationship on the place page and on the document detail page, so that I can track ambiguity. 🆕 enhancement, performant
- Bracket and other character search is functioning unpredictably 🐛 bug, performant
- development (1 points, 10 issues)
- ppa-django (0 points, 6 issues)
- development (0 points, 6 issues)
- Get the METS-XML and pairtree data in a shareable form so that other Princeton researchers can use it chore
- Thumbnails and snippets occasionally mismatched bug
- Discrepancy between total works / pages as reported by database and solr bug
- Adding works that don't have pairtree prefixes/version files in the directory causes 500 error bug
- update replication playbook to include Solr data chore
- check whether hathitrust rsync update improves ocr chore
- development (0 points, 6 issues)