Skip to content
4 changes: 1 addition & 3 deletions src/schema/core.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -9,9 +9,7 @@ imports:
- attribute_values
- basic_slots
- basic_classes
- portal_emsl
- portal_jgi_metagenomics
- portal_jgi_metatranscriptomics
- portal_user_facility
- portal_mixs_inspired
- portal_sample_id

Expand Down
85 changes: 0 additions & 85 deletions src/schema/portal_emsl.yaml

This file was deleted.

52 changes: 0 additions & 52 deletions src/schema/portal_jgi_metagenomics.yaml

This file was deleted.

48 changes: 0 additions & 48 deletions src/schema/portal_jgi_metatranscriptomics.yaml

This file was deleted.

154 changes: 154 additions & 0 deletions src/schema/portal_user_facility.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,154 @@
name: user_facility_metadata
description: This file defines terms that appear in the 'JGI-Metagenomics', `JGI-Metagenomics Long Read`, `JGI-Metatranscriptomics`, `EMSL` sections
of the NMDC sample metadata submission portal, which is implemented with DataHarmonizer
as of Spring 2022
title: 'NMDC Schema supplement for Submission Portal: User Facility Metadata'
id: https://w3id.org/nmdc/portal/jgi_metagenomics

license: https://creativecommons.org/publicdomain/zero/1.0/

imports:
- portal_enums
- nmdc_types

prefixes:
linkml: "https://w3id.org/linkml/"
nmdc: "https://w3id.org/nmdc/"

default_prefix: nmdc
default_range: string

slots:
rna_collect_site:
description: Provide information on the site your RNA sample was collected from
title: RNA collection site
examples:
- value: untreated pond water
rank: 15
string_serialization: '{text}'
recommended: true
deprecated: true; as of 2025-10-10, NMDC doesn't need to store this JGI metadata slot & it's now captured directly in the submission portal schema to support submissions
rna_isolate_meth:
description: Describe the method/protocol/kit used to extract DNA/RNA.
title: RNA isolation method
examples:
- value: phenol/chloroform extraction
aliases:
- Sample Isolation Method
rank: 16
string_serialization: '{text}'
recommended: true
rna_organisms:
description: List any organisms known or suspected to grow in co-culture, as well
as estimated % of the organism in that culture.
title: RNA expected organisms
examples:
- value: expected to contain microbes (59%) fungi (30%), viruses (10%), tadpoles
(1%)
rank: 14
string_serialization: '{text}'
recommended: true
deprecated: true; as of 2025-10-10, NMDC doesn't need to store this JGI metadata slot & it's now captured directly in the submission portal schema to support submissions
dna_lr_isolate_meth:
description: Describe the method/protocol/kit used to extract DNA/RNA.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm inclined to ask for the description to say something about the goal of extracting DNA fragments of a certain length

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These slots will be captured in mongoDB. That's why I've added them here and removed the ones that are submission portal only.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If we won't have records for this slot, it shouldn't be in nmdc-schema.
The information this slot is meant to capture would go on Class: Extraction
Should still update this description to clarify this if the long-read extraction method.

title: DNA isolation method for long read sequencing
examples:
- value: phenol/chloroform extraction
rank: 16
string_serialization: '{text}'
recommended: true
aliases:
- Sample Isolation Method
dna_collect_site:
description: Provide information on the site your DNA sample was collected from
title: DNA collection site
examples:
- value: untreated pond water
rank: 15
string_serialization: '{text}'
recommended: true
deprecated: true; as of 2025-10-10, NMDC doesn't need to store this JGI metadata slot & it's now captured directly in the submission portal schema to support submissions
dna_isolate_meth:
description: Describe the method/protocol/kit used to extract DNA/RNA.
title: DNA isolation method
examples:
- value: phenol/chloroform extraction
rank: 16
string_serialization: '{text}'
recommended: true
aliases:
- Sample Isolation Method
dna_organisms:
description: List any organisms known or suspected to grow in co-culture, as well
as estimated % of the organism in that culture.
title: DNA expected organisms
examples:
- value: expected to contain microbes (59%) fungi (30%), viruses (10%), tadpoles
(1%)
rank: 14
string_serialization: '{text}'
recommended: true
deprecated: true; as of 2025-10-10, NMDC doesn't need to store this JGI metadata slot & it's now captured directly in the submission portal schema to support submissions
emsl_store_temp:
description: The temperature at which the sample should be stored upon delivery
to EMSL
title: EMSL sample storage temperature, deg. C
todos:
- add 'see_also's with link to NEXUS info
comments:
- Enter a temperature in celsius. Numeric portion only.
examples:
- value: '-80'
rank: 4
string_serialization: '{float}'
recommended: true
project_id:
description: Proposal IDs or names associated with dataset
title: project ID
rank: 1
string_serialization: '{text}'
recommended: true
replicate_number:
description: If sending biological replicates, indicate the rep number here.
title: replicate number
comments:
- This will guide staff in ensuring your samples are blocked & randomized correctly
rank: 6
string_serialization: '{integer}'
recommended: true
sample_shipped:
description: The total amount or size (volume (ml), mass (g) or area (m2) ) of
sample sent to EMSL.
title: sample shipped amount
comments:
- This field is only required when completing metadata for samples being submitted
to EMSL for analyses.
examples:
- value: 15 g
- value: 100 uL
- value: 5 mL
rank: 3
string_serialization: '{float} {unit}'
recommended: true
sample_type:
description: Type of sample being submitted
title: sample type
comments:
- This can vary from 'environmental package' if the sample is an extraction.
examples:
- value: water extracted soil
rank: 2
range: SampleTypeEnum
recommended: true
technical_reps:
description: If sending technical replicates of the same sample, indicate the
replicate count.
title: number technical replicate
comments:
- This field is only required when completing metadata for samples being submitted
to EMSL for analyses.
examples:
- value: '2'
rank: 5
string_serialization: '{integer}'
recommended: true
4 changes: 1 addition & 3 deletions src/scripts/check_schema_self_containment.py
Original file line number Diff line number Diff line change
Expand Up @@ -28,10 +28,8 @@
"src/schema/nmdc_subsets.yaml",
"src/schema/nmdc_types.yaml",
"src/schema/nmdc.yaml",
"src/schema/portal_emsl.yaml",
"src/schema/portal_enums.yaml",
"src/schema/portal_jgi_metagenomics.yaml",
"src/schema/portal_jgi_metatranscriptomics.yaml",
"src/schema/portal_user_facility.yaml",
"src/schema/portal_mixs_inspired.yaml",
"src/schema/portal_sample_id.yaml"
}
Expand Down