Many resources are needed to download a project. Please understand that we have to compensate our server costs. Thank you in advance. Project price only 1 $
You can buy this project and download/modify it how often you want.
Root element of the document identifying it as an HML message. Must
contain the version of HML that the modeled data in this document
uses.
Children:
---------
- property (optional, qty: 0 or more)
- hmlid (optional, qty: 1)
- reporting-center (optional, qty: 1) - Required for NMDP samples
- sample (required, qty: 1 or more)
- typing-test-names (optional, qty: 0 or more)
Attributes:
-----------
- version: Version of HML the document follows (required)
- project-name: Name of the typing project (optional)
Expected to be '1.0.1' to use this version of the HML schema.
This element identifies the entity/organization sending this HML data.
If included, must contain a unique ID identifying the sender as well as
a context which defines to whom the ID is meaningful or the source
of the ID.
This element is required for NMDP transactions and if context is not
included, is assumed to be "NMDP".
Attributes:
-----------
- reporting-center-id: (required) Unique id of reporting center
like "789".
- reporting-center-context: (optional) Source of the reporting center ID
like "NMDP". To whom the ID is meaningful.
Specifies a unique identifier for this HML document. This id follows
the HL7 standard for uniqueness using a two-part key. 'root' is the
unique organization identifier publicly registered for your organization.
'extension' is the unique document id managed internally for your
organization, but must be unique and identify this specific HML document.
Together root and extension guarantee global uniqueness.
http://www.oid-info.com/faq.htm
http://www.hl7.org/oid/index.cfm
Attributes:
-----------
- root: Unique publicly registered identifier for the HML
creator's organization.
(ex: NMDP HL7 id is "2.16.840.1.113883.3.1470")
Format is expected to be a string of digits and dot
delimiters. (required)
- extension: A unique document identifier managed internally by the
organization specified in 'root'. Can be any alpha-numeric
format desired by the organization. (ex: "hml-0.9.7-123456789.23a")
(optional and must be at least 1 non-whitespace character
long)
NOTE - If extension is NOT included, the unique document
identifier is expected to be appended at the end of the root
identifier above in accordance with HL7 practices.
Unique publicly registered identifier for the HML creator's organization.
This can be an HL7 compliant field (ex: NMDP is "2.16.840.1.113883.3.1470")
Format is expected to be a string of digits and dot delimiters. (required)
A unique document identifier managed internally by the
organization specified in 'root'. Can be any alpha-numeric
format desired by the organization. (ex: "hml-0.9.7-123456789.23a")
(required and must be at least 1 non-whitespace character long and
guarantee uniqueness)
Specifies a list of test names internally referenced by an "sso"
element or an "ssp" element. It wraps a list of "typing-test-name"
elements, which contain the test identifiers.
Children:
---------
- typing-test-name: (required, qty: 1 or more)
Attributes:
-----------
- test-id: (required, qty: 1) Reference identifier (unique string)
internal to the document used for referencing the list of
tests contained here with a typing-method.
Specifies a single test name contained in a referenced "typing-test-
names" list. Typing tests may be referenced by other elements
including SSO, SSP, etc.
Attributes:
-----------
- name: (required) Fully qualified test name
(ex: "L999.K1.V1.A9F-S11", "L999.K1.V1.SSP12345")
Allows the optional inclusion of key-value pairs (not
defined explicitly by the schema) without the need to extend or change
the schema. Allows children to be extensible for custom use.
Any information contained in this element must be fully understood by
the message recipient.
Attributes:
-----------
- name: (required) "key" in the name-value pair
- value: (optional) "value" in the name-value pair
Encloses the genotyping data pertaining to a particular sample. It may
contain multiple typing elements (for instance, one for each locus).
Children:
---------
- property: (optional, qty: 0 or more)
- collection-method: (optional, qty: 0 or 1) - Free-form text such as
"swab", "filter paper", and "blood aliquots".
- typing: (required, qty: 1 or more)
Attributes:
-----------
- id: (required) Identifier for the sample (ex: "1234-5678-9", "123456789")
- center-code: (optional) Center code of the sample's origin (donor center,
transplant center, etc.)
Encapsulates the primary data from a genotyping method with an
optional genotyping result (allele-assignment) determined from the
primary data and/or optional consensus sequences.
Children:
---------
- property: (optional, qty: 0 or more)
- allele-assignment: (not required, qty: 0 or more) Also known as interpretation.
- typing-method: (required, qty: 1 or more)
The 'typing-method' element encapsulates methods such as sso, ssp,
sbt-sanger, and sbt-ngs.
- consensus-sequence: (optional, qty: 0 to many) Consensus data for the
results reported under typing-method.
Also allows an optional "property" element that may have nested/custom
use data related to the interpretation.
Attributes:
-----------
- gene-family: Represents the gene evaluated in this typing report, e.g.
"HLA" or "KIR" (required)
See: http://www.genenames.org/genefamilies for examples.
- date: Typing/testing date for this sample (required)
Specifies the genotyping call at the most specific level possible.
This call can be represented within haploid elements or using gl-
resources. When reporting data using haploid, typical use is one
or two haploid elements for a particular locus, but possibly more if
multiple loci are covered (ex: two DRB1 haploids + one DRB3 haploid).
Children:
---------
- property: (optional, qty 0 to many) Custom use properties.
- haploid: (optional, qty: 1 or more)
- genotype-list: (optional, qty: 0 or more)
- glstring: (optional, qty: 0 ro more)
Allows an optional "property" element that may have nested/custom
use data related to the allele-assignment/interpretation.
Expects at least one of haploid, genotype-list, or glstring.
Attributes:
-----------
- date: Date on which the typing was carried out, or on which the
final call was determined. Format can be either ISO-8601 or
"YYYY-MM-DD". (required)
- allele-db: Database or other source for nomenclature used in the
interpretation. (ex: "IMGT-HLADB") (optional, but required for NMDP use)
- allele-version: A specific version of the allele-db (ex: "3.18.0").
(optional, but required for NMDP use)
Specifies one-half of a full typing at a particular locus. Must
conform to the database specified in allele-assignment/interpretation.
Attributes:
-----------
- locus: Locus (ex: "HLA-A", "HLA-DRB1") (required)
- method: Typing method used (ex: "DNA", "SER") (required)
- type: Allele/code level type (ex: "01:01", "01:AB") (required)
Specifies a resource in Genotype List String (GL String) format for the
interpretation of a typing result, or a URI identifying a resource in
GL String format. For more details about the format and use of GL Strings,
see (http://www.ncbi.nlm.nih.gov/pubmed/23849068)
* glstring is expected to EITHER contain inline character data OR a
URI reference to a location that defines/specifies the glstring data.
Attributes:
-----------
- uri: Specifies a URI identifying a resource in GL String format for the
interpretation of a typing result. For more information about the format
and use of GL Strings, see (http://www.ncbi.nlm.nih.gov/pubmed/23849068).
(optional)
Data:
-----
- resource in GL String representation (string, required)
A genotype-list represents a full unambiguous list of possibilities for
the typing of a sample (NOTE: This element and its children were deprecated
in HML 1.0). The values of the elements in this genotype-list (each allele
element) should conform to the nomenclature specified by the
allele-assignement/interpretation.
Children:
---------
- diploid-combination (required, qty: 1 or more)
A diploid-combination element is one possibility value in a genotype-
list (NOTE: This element and its children were deprecated in HML 1.0).
There may be either one or two locus-block child elements,
depending on whether the data provided in this diploid-combination
covers one or two chromosomes.
Children:
---------
- locus-block (required, qty: 1 or 2)
A locus-block element allows allele-list elements to be grouped
together to mean one allele-list is a possibility if and only if all
others are (NOTE: This element and its children were deprecated in HML 1.0).
This is useful, for example, in the case when listing
HLA-DRB1 alleles next to the corresponding HLA-DRB3 alleles that are
relevant in only some cases (example in comments).
Children:
---------
- allele-list (required, qty: 1 or more)
An allele-list element is a representation of the list of allele
possibilities for a genotype (NOTE: This element and its children were
deprecated in HML 1.0). NMDP has historically used allele codes
in combination with allele families to represent this.
Children:
---------
- allele (required, qty: 1 or more)
An allele element specifies a single allele: it should be given in
LOCUS*NAME format and names must be at allele-level resolution (NOTE:
This element and its children were deprecated in HML 1.0). The
value must conform to the nomenclature specified in the interpretation.
Attributes:
-----------
- present: Indicates the presence or absence of this allele. A value
of "N" can be used to indicate that a particular allele was tested
for and found not to be a possibility. A value of "U" (untested)
indicates that the given allele was not tested for. The default
value is "Y". [Y|N|U] (required, qty: 1 or more)
An enumerated type indicating the presence or absence of an allele.
N: a particular allele was tested for and found not to be a
possibility
U: the given allele was not tested for
Y: (default) the given allele was tested for and found to be a
possible result
Must include at least one of sso, ssp, sbt-sanger, and/or sbt-ngs
for the 'typing-method' element.
Specifies an SSO (sequence-specific oligonucleotide) test that was
done for this sample. Kit information and scores must be identified
to allow for later test reinterpretation. For NMDP, a corresponding
typing-test-names/typing-test-name structure is expected in this
same HML document.
Children:
---------
Allows an OPTIONAL "property" element that may have nested/custom
use data related to this typing-method.
Attributes:
-----------
- locus: locus for multi-locus targets (optional)
- test-id: Test ID as registered with the test-id-source.
- test-id-source: A formal or formal test registry location. For
example, this could be the NCBI GTR (specified as "GTR"),
NMDP for tests registered directly with NMDP (specified as
"NMDP), etc. (required if test-id is used)
- scores: The results of the SSO test, specified as one string
(ex: "118111100181")
NMDP allows the following test-id-source values:
(Note that this may change in future versions)
* gtr: ID of kit registered with the NCBI Genetic
Testing Registry. (Preferred)
* nmdp-refid: ID of kit registered with NMDP. The cardinal
sequence numbers of the registered probes in the
kit will determine the score order.
* probe-name: Fully qualified probe name. If this attribute is
used, the scores attribute must contain exactly
one score. (ex: "L0999.K1.V1.A9F-S11")
Specifies an SSP (sequence-specific primer) test that was performed for
this sample. Kit information and scores must be identified to allow for
later test re-interpretation.
Children:
---------
Allows an OPTIONAL "property" element that may have nested/custom
use data related to this typing-method.
Attributes:
-----------
- locus: locus for multi-locus targets (optional)
- test-id: Test ID as registered with the test-id-source.
- test-id-source: A formal or formal test registry location. For
example, this could be the NCBI GTR (specified as "GTR"),
NMDP for tests registered directly with NMDP (specified as
"NMDP), etc. (required if test-id is used)
- scores: The results of the SSP test, specified as one string
(ex: "118111100181")
NMDP allows the following test-id-source values:
(Note that this may change in future versions)
* gtr: ID of kit registered with the NCBI Genetic
Testing Registry. (Preferred)
* nmdp-refid: ID of kit registered with NMDP. The cardinal
sequence numbers of the registered probes in the
kit will determine the score order.
* probe-name: Fully qualified probe name. If this attribute is
used, the scores attribute must contain exactly
one score. (ex: "L0999.K1.V1.A9F-S11")
Describes an SBT (sequence-based typing) that was performed using a
Sanger technique.
Children:
---------
- amplification (required, qty: 1)
- sub-amplification (not required, qty: 0 or more)
- gssp (not required, qty: 0 or more)
Also allows an optional "property" element that may have nested/custom
use data related to this typing-method.
Attributes:
-----------
- locus: The locus for which the SBT was performed. (optional)
- test-id: Test ID as registered with the test-id-source.
- test-id-source: A formal or formal test registry location. For
example, this could be the NCBI GTR (specified as "GTR"),
NMDP for tests registered directly with NMDP (specified as
"NMDP), etc. (required if test-id is used)
NMDP allows the following test-id-source values:
(Note that this may change in future versions)
* gtr: ID of kit registered with the NCBI Genetic
Testing Registry. (Preferred)
* nmdp-refid: ID of kit registered with NMDP. The cardinal
sequence numbers of the registered probes in the
kit will determine the score order.
* probe-name: Fully qualified probe name. If this attribute is
used, the scores attribute must contain exactly
one score. (ex: "L0999.K1.V1.A9F-S11")
Identifies the amplification primer used for SBT-Sanger, and the
resulting sequence from using it.
Attributes:
-----------
- registered-name: Identifies the amplification primer. Must be
recognized by the message recipient. (string, required)
Data:
----
- sequence: IUPAC nucleotide sequence (string, required)
Identifies sub-amplification primers. These primers are used to resolve
ambiguities and may be used either concurrently with or after the
amplification step.
Attributes:
-----------
- registered-name: Identifies the amplification primer. Must be
recognized by the message recipient. (string, required)
Data:
----
- sequence: IUPAC nucleotide sequence (string, required)
Describes the Group Specific Sequencing Primer used.
Attributes:
-----------
- registered-name: Identifies the amplification primer. Must be
recognized by the message recipient. (string, optional)
- primer-sequence: PCR primer sequences used to amplify a polymorphic
region of sequences. (string, optional)
- primer-target: If the primer sequence is proprietary (or otherwise
unable to be explicitly specified), the primer sequence can be imputed
from the gssp result. This imputed primer sequence is specified as the
primer-target. (string, optional)
Data:
-----
- Resulting nucleotide sequence from the GSSP used. (string, required)
Describes an NGS (next-generation sequencing) event that was performed.
Children:
---------
- property: (optional, qty, 0 or more)
- raw-reads (optional, qty: 0 or more)
Also allows an optional "property" element that may have nested/custom
use data related to this typing-method.
Attributes:
-----------
- locus: The locus for which the SBT was performed. (optional)
- test-id: Test ID as registered with the test-id-source.
- test-id-source: A formal or formal test registry location. For
example, this could be the NCBI GTR (specified as "gtr"),
NMDP for tests registered directly with NMDP (specified as
"NMDP"), etc. (required if test-id is used)
NMDP allows the following test-id-source values:
(Note that this may change in future versions)
* gtr: ID of kit registered with the NCBI Genetic
Testing Registry. (Preferred)
* nmdp-refid: ID of kit registered with NMDP. The cardinal
sequence numbers of the registered probes in the
kit will determine the score order.
* probe-name: Fully qualified probe name. If this attribute is
used, the scores attribute must contain exactly
one score. (ex: "L0999.K1.V1.A9F-S11")
The DNA alphabet consists of primary nucleotides (A, C, G, T).
Wildcard IUPAC nucleotides (M, R, W, S, Y, K, V, H, D, B, X, N) may be
used if they are acceptable in the context in which they appear. The
default is to use all upper case letters.
The full specification of the IUPAC codes may be found here:
(http://nar.oxfordjournals.org/content/13/9/3021.short)
Cornish-Bowden A. Nomenclature for incompletely specified bases in
nucleic acid sequences: recommendations 1984. Nucleic Acids Res. 1985;
13:3021-3030.
Attribute:
----------
- xs:anyAttribute: Custom use attribute for additional sequence
information. (optional)
Data:
-----
- Sequence in the DNA alphabet (string, required)
A string of simple nucleotide bases. These are A, C, G and T (DNA).
Used to define reference bases and alternate bases for sequence
variants.
Nucleotide bases representing sequence ambiguity.
Primary nucleotides: A, C, G, T (DNA).
"Wildcard" nucleotides: M, R, W, S, Y, K, V, H, D, B, X, N.
Wildcard nucleotides may be used if they are acceptable in the context
in which they appear. The default is to use all upper case letters.
The full specification of the IUPAC codes may be found here:
(http://nar.oxfordjournals.org/content/13/9/3021.short)
Cornish-Bowden A. Nomenclature for incompletely specified bases in
nucleic acid sequences: recommendations 1984. Nucleic Acids Res. 1985;
13:3021-3030.
The bases of the sequence string are restricted to the upper and lower case
versions of the nucleotides specified above.
Data:
----
- Nucleotide sequence in DNA alphabet (string, required)
Nucleotide bases representing sequence ambiguity.
Primary nucleotides: A, C, G, T (DNA).
"Wildcard" nucleotides: M, R, W, S, Y, K, V, H, D, B, X, N.
Wildcard nucleotides may be used if they are acceptable in the context
in which they appear. The default is to use all upper case letters.
The full specification of the IUPAC codes may be found here:
(http://nar.oxfordjournals.org/content/13/9/3021.short)
Cornish-Bowden A. Nomenclature for incompletely specified bases in
nucleic acid sequences: recommendations 1984. Nucleic Acids Res. 1985;
13:3021-3030.
The bases of the sequence string are restricted to the upper and lower case
versions of the nucleotides specified above.
Data:
----
- Nucleotide sequence in DNA alphabet (string, required)
Describes a sequence that is the result of an alignment or
assembly of shorter sequence reads generated by an NGS platform.
Wraps one or more reference database definitions and one or more
consensus-sequence-blocks.
Children:
---------
- reference-database (required, qty: 1 or more)
- consensus-sequence-block (required, qty: 1 or more)
A consensus sequence contains a sequence of IUPAC nucleotides and novel
variants.
The nucleotides are specified in the DNA alphabet.
The DNA alphabet consists of primary nucleotides (A, C, G, T).
Wildcard IUPAC nucleotides (M, R, W, S, Y, K, V, H, D, B, X, N) may be
used if they are acceptable in the context in which they appear. The
default is to use all upper case letters.
The full specification of the IUPAC codes may be found here:
(http://nar.oxfordjournals.org/content/13/9/3021.short)
Cornish-Bowden A. Nomenclature for incompletely specified bases in
nucleic acid sequences: recommendations 1984. Nucleic Acids Res. 1985;
13:3021-3030.
Children:
--------
- sequence: (required, qty: 1) Nucleotide data for the consensus block.
- variant: (optional, 0 or more) If region-match is false,
variant is expected to refer to the novel-variants.
- sequence-quality: (optional, qty: 0 or more) A score for a sub-sequence
specified by start and end (includes 'start', excludes
'end') that indicates the quality of the read.
Attributes:
----------
- reference-sequence-id: (required) Reference to a unique reference-sequence
defined in this document under "consensus-sequence". IDREF
must exactly match the ID for the reference-sequence.
- start: (required) Start position of a targeted region on contig,
0-based or space-counted coordinate system, closed-open range
- end: (required) End position of a targeted region on contig,
0-based or space-counted coordinate system, closed-open range
- strand: (optional) String value (eg. one of "-1", "1", "-", "+");
defaults to "+" if unspecified
- phasing-group: Phasing group identifier - DEPRECATED. Use "phase-set".
- phase-set: Phase set identifier (string, optional)
- continuity: (optional) True if this represents a continuous read, false
if not continuous.
- expected-copy-number: (optional) Integer for how many copies of
the sequence block were expected (0 to n).
- description: (optional) Text description of the targeted region, like "HLA-A exon 3"
- xs:anyAttribute: Custom use attribute for additional sequence
information. (optional)
Defines the quality for a range within the consensus sequence block.
Attribute:
---------
- start: (required) Sequence start position for quality - inclusive
- end: (required) Sequence end position for quality - not inclusive
- quality-score: (required) Value indicating the quality of the consensus sequence.
Used to indicate a quality-score for a variant/consensus sequence block.
A database reference for the consensus sequence blocks included in the
HML document.
Each reference-database may have 1 or more reference-sequence definitions.
Examples:
Children:
--------
- reference-sequence: (required, qty: 1 or more) Reference sequence for this database.
Attribute:
---------
- name: (optional) Name for this database.
- description: (optional) Description of this database reference.
- version: (optional) Version of this reference database.
- availability: (required) Defines how this reference database is
available ("public", "private", "none").
- curated: (optional) "true" if curated, "false" otherwise.
- uri: External reference for this database.
A sequence reference for the consensus sequence blocks included in the
HML document, associated with a reference-database.
The reference-sequence id must be document-unique and is referenced
by consensus-sequence-block elements via the required
reference-sequence-id attribute.
Examples:
Attribute:
---------
- id: (required) Unique reference for this
database/sequence combination which is referred to
in each consensus-sequence-block.
*Note: XML 'ID' type must begin with a non-symbol,
non-digit, alphabetic character.
- name:
- description: (optional) Description of this database reference.
- start: (required) Start position of a targeted region on contig,
0-based or space-counted coordinate system, closed-open range
- end: (required) End position of a targeted region on contig,
0-based or space-counted coordinate system, closed-open range
- accession: (optional)
- strand: (optional) String value (eg. one of "-1", "1", "-", "+");
defaults to "+" if unspecified
- uri: External reference for this database.
A variant needs to be included for a sequence if consensus-sequence-block
doesn't match a known database, meaning there is some ambiguity in the
submitted sequence of nucleotide bases.
Children:
--------
- variant-effect: (optional) Effect of this variation from the published
sequence.
Attribute:
---------
- id:
- name:
- start: Variant sequence start position - ('0' based).
- end: Variant sequence end position.
- reference-bases: The nucleotide bases from which the reported sequence
differs. (required, A/G/C/T string)
- alternate-bases: The nucleotide bases to substitute for the reported
sequence. (required, A/G/C/T string)
- quality-score: Quality of the variant (optional).
- filter: Values 'PASS' or 'FAIL' as used in VCF format.
- uri: External reference for this variant.
A child of "variant", defines the effect of this variation from the
published sequence.
Sequence Ontology (SO) variant effect terms are specifications of the
sequence_variant (http://sequenceontology.org/browser/current_svn/term/SO:0001060) term.
HGVS (Human Genome Variation Society) information can be found here:
http://www.hgvs.org/mutnomen/disc.html
Additional attributes may be used to provide more information on the effect, for
example severity, POLYPHEN prediction, SIFT score, etc.
Attribute:
---------
- term: (required) Sequence Ontology (SO) term describing the effect, e.g.
"synonymous_variant" (http://sequenceontology.org/browser/current_svn/term/SO:0001819)
"missense_variant" (http://sequenceontology.org/browser/current_svn/term/SO:0001583)
- hgvs: (optional) Human Genome Variation Society variant effect description, e.g.
ENST00000288602.3:c.83T>A
ENSP00000288602.1:p.Val28Glu
- uri: (optional) External reference for this variant effect.
Reports the raw sequence reads generated by an NGS platform. Because
various platforms report reads in various formats, the platform must be
specified. Since this data is quite large even for relatively small
regions of the genome, this information must be linked to using an
external URI.
Attributes:
-----------
- uri: An external link to the raw reads. (required)
- format: Identifies the format of the data located at the URI. (required)
- paired: true/false (default) (required)
- pooled: true/false (default) (required)
- availability: public|private|permission (optional)
- adapterTrimmed: true/false (default) (required)
- qualityTrimmed: true/false (default) (required)