Skip to content

GitLab

  • Projects
  • Groups
  • Snippets
  • Help
    • Loading...
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
  • Sign in
T tools
  • Project overview
    • Project overview
    • Details
    • Activity
    • Releases
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
    • Locked Files
  • Issues 1
    • Issues 1
    • List
    • Boards
    • Labels
    • Service Desk
    • Milestones
    • Iterations
  • Merge requests 0
    • Merge requests 0
  • Operations
    • Operations
    • Incidents
  • Packages & Registries
    • Packages & Registries
    • Package Registry
  • Analytics
    • Analytics
    • Code Review
    • Issue
    • Repository
    • Value Stream
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Members
    • Members
  • Activity
  • Graph
  • Create a new issue
  • Commits
  • Issue Boards
Collapse sidebar
  • SciCat
  • tools
  • Issues
  • #2

Closed
Open
Created Sep 17, 2022 by gerchow_l@gerchow_l

Request: add dataset name to dataset list for archiver

Hi,

When separating the ingestion and archiving step, the archiver produces output like shown below. While I do understand that the ideal case is a folder structure representing the datasets we are far from setting such thing up and rather work with the filelist.txt option pulling files from different subfolders which does the job for us. However, when running the archiver separate (not using the ingestor -autoarchive option) the list of datasets is little to no help to decide if everything should be archived. See an example of two datasets below.

Would be great if more details could be show, i.e. the datasetName field.

Bests, Lars

2022/09/17 01:26:05 Latest version: 1.1.8
2022/09/17 01:26:05 Your version of this program is up-to-date
2022/09/17 01:26:05 You are about to archive dataset(s) to the === production === data catalog environment...
2022/09/17 01:26:05 User authenticated: gerchow_l lars.gerchow@psi.ch
2022/09/17 01:26:05 User is member in following groups: [p19875 a-35484 lars.gerchow@psi.ch]
2022/09/17 01:26:05 Found the following datasets in state archivable: (size=0 datasets are removed)
2022/09/17 01:26:05 Folder: /data/2020/mixe2020_offline, size: 2285798633, PID: 20.500.11935/351710ab-de8d-41a4-b495-933265f08379
2022/09/17 01:26:05 Folder: /data/2020/mixe2020_offline, size: 3000309087, PID: 20.500.11935/44679a14-7c2e-48a6-9cb2-32a12d936856
Assignee
Assign to
None
Milestone
None
Assign milestone
Time tracking