knowledge-enrichment/.forgejo/workflows
Michał Szczepanik d340ec508c Add enrich publications workflow
This adds a workflow which runs the publication enrichment via doi.org.

Given that the DOI org information will change very rarely, and we don't
(yet) have ways to say "this record is complete / needs no enrichment",
the workflow currently only has a "workflow dispatch" trigger.

Two optional inputs can be specified when dispatching the workflow: list
of PIDs and inbox label. These will limit processing to a subset of
records. Otherwise, all records will be processed.

Properties which can change based on the pool / data model (API URL,
collection name, class names) are kept as env variables to make tweaks
easier.

In the last step (process record), inputs are assigned (export) to
environment variables to avoid issues when the runner is filling them in
(eg. end of line after `<<<` when pids are not provided was a syntax
error). To supply the optional `--incoming label` argument to dtc
get-records, parameter expansion is used (`${parameter:+word}` expands
to nothing if parameter is null or unset, otherwise expansion of word is
used).
2026-03-25 16:09:47 +01:00
..
enrich_publications.yml Add enrich publications workflow 2026-03-25 16:09:47 +01:00
scrape.yml run workflow weekly on sunday at 3pm 2026-03-04 09:08:37 +01:00