Workflow: retrieve metadata from Zenodo community

Fetched 2023-01-14 18:30:17 GMT

For a given Zenodo community, retrieve its repository records as Zenodo JSON and (eventually) schema.org JSON-LD and DataCite v4 XML.

children parents
Workflow as SVG
  • Selected
  • Default Values
  • Nested Workflows
  • Tools
  • Inputs/Outputs

Inputs

ID Type Title Doc
community String (Optional)

The short-name of the Zenodo community, e.g. \"ro\" for <https://zenodo.org/communities/ro> Use null for all of Zenodo.

Steps

ID Runs Label Doc
list-ids
../tools/oai-pmh.cwl (CommandLineTool)
oai_pmh ListIdentifiers

List repository identifiers using OAI-PMH protocol

make-uri
../tools/sed.cwl (CommandLineTool)
sed search-replace

Search-replace a stream using regular expressions and other SED commands.

gather-json
../tools/gather-directory.cwl (ExpressionTool)
Gather directory

Gather array of array of Files, flattened into a single Directory. Files are assumed to have unique filenames. Optionally the files can be renamed to add an provided file extension.

chunk-by-line
../tools/split-lines.cwl (CommandLineTool)
split a file into smaller pieces

Output pieces of FILE by splitting into multiple files, e.g. \"xaa\", \"xab\", \"xac\", ... where \"x\" is the default prefix. The filename suffixes are alphabetical in order corresponding to file input, using a suffix letters a-z. The suffix is expanded by 2 characters before exhaustion to \"z\", making increasingly longer filenames that are still ordered correctly, e.g. \"yy\", \"yz\", \"zaaa\", \"zaab\", later \"zyzz\", \"zzaaaa\", \"zzaaab\".

gather-jsonld
../tools/gather-directory.cwl (ExpressionTool)
Gather directory

Gather array of array of Files, flattened into a single Directory. Files are assumed to have unique filenames. Optionally the files can be renamed to add an provided file extension.

gather-datacite4
../tools/gather-directory.cwl (ExpressionTool)
Gather directory

Gather array of array of Files, flattened into a single Directory. Files are assumed to have unique filenames. Optionally the files can be renamed to add an provided file extension.

fetch-zenodo-json
../tools/curl-get-many.cwl (CommandLineTool)
curl: download HTTP files from URLs

curl will download the HTTP/HTTPS files from the given URLs, following any redirections. A list of files is returned, using filenames reflecting the URI path.

split-ids-by-line
../tools/split-by-line.cwl (ExpressionTool)
Split by line

Read file content and split by newline into array of strings.

fetch-zenodo-jsonld
../tools/curl-get-many.cwl (CommandLineTool)
curl: download HTTP files from URLs

curl will download the HTTP/HTTPS files from the given URLs, following any redirections. A list of files is returned, using filenames reflecting the URI path.

fetch-zenodo-datacite4
../tools/curl-get-many.cwl (CommandLineTool)
curl: download HTTP files from URLs

curl will download the HTTP/HTTPS files from the given URLs, following any redirections. A list of files is returned, using filenames reflecting the URI path.

Outputs

ID Type Label Doc
zenodo-json Directory

A directory of Zenodo JSON records retrieved from the given Zenodo community

zenodo-jsonld Directory

A directory of Zenodo schema.org JSON-LD records retrieved from the given Zenodo community

zenodo-datacite4 Directory

A directory of Zenodo DataCitev4 XML records retrieved from the given Zenodo community

Permalink: https://w3id.org/cwl/view/git/5f4b5aab2a9e0fa53eb3d49d62a52d47025a967d/code/data-gathering/workflows/zenodo-records.cwl