Skip to content

Command Line usage example commands do not run properly #137

@yxuil

Description

@yxuil

Describe the bug
docs/cli.rst has the load command examples. However, the load command example doesn't run correctly.

To Reproduce
Steps to reproduce the behavior:

  1. Go to 'docs/cli.rst'
  2. copy the load command in the Loading section
    seqrepo --root-directory $SEQREPO_ROOT/master load -n NCBI mirror/ftp.ncbi.nih.gov/refseq/H_sapiens/mRNA_Prot/human.*.gz
  3. paste to shell terminal.
  4. results in error:
    FileNotFoundError: [Errno 2] No such file or directory: 'mirror/ftp.ncbi.nih.gov/refseq/H_sapiens/mRNA_Prot/human.*.gz'

I have tried trim off the leading mirror/ or replace it with ftp:// but both didn't work.

Expected behavior
The CLI document is a few years old, and could use some update. There are additional CLI commands that are not covered. The short description with --help is hard to start with.

I am particularly interested in loading individual sequences in an existing instance. For example transcript NM_001387679.1 doesn't seem to be in the latest data pull 2023-09-28. It would be nice to know how to add it. Looks like the fetch-load is the possible command, but the cli.rst didn't mention this and a few other commands. Tried a few times but none is working:
seqrepo fetch-load -i 2023-09-28 -n NCBI NM_001387679.1

Additional context
I am using seqrepo conjunction with UTA or Cdot for validating variants. Found that in some occasions the transcript ID is annotated in UTA or Cdot but its sequence cannot be retrieved from seqrepo. I am hoping load those few missing transcripts with the command line tools, and looking for similar use cases.

Metadata

Metadata

Assignees

Labels

bugSomething isn't workingdocumentationImprovements or additions to documentation

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions