Data Release Notes: Difference between revisions
Jump to navigation
Jump to search
Version 1.0.1 |
1.0.4 release |
||
| (8 intermediate revisions by the same user not shown) | |||
| Line 1: | Line 1: | ||
== Versioning Format == | |||
The versioning format follows a three-digit structure: X.Y.Z. | |||
* The first digit (X) changes when a major update is introduced, such as changes in the data model. | |||
* The second digit (Y) increments with each new release. | |||
* The third digit (Z) is updated for bug fixes or minor changes. | |||
== Version 1.0.0 == | == Version 1.0.4 == | ||
This release introduces new datasets, cross-references, and bug fixes. | |||
=== Data Updates === | |||
* Added Cancer Genome Interpreter data on cancer biomarkers from MetaKB. | |||
* Added Metabolomics Workbench LOINC data on metabolite biomarkers. | |||
* Added Cell Ontology and Protein Ontology cross-references. | |||
=== Bug Fixes === | |||
* Fixed issue where cookie preferences weren't being saved when selecting "Allow". | |||
== Version 1.0.3 == | |||
This release introduces new cross-references and updates to ensure compatibility with external resources. | |||
=== Data Updates === | |||
* NCBI cross-references added across gene biomarker entries. | |||
* ChEBI cross-references integrated for small molecules and metabolites. | |||
=== Backend and Infrastructure Updates === | |||
* ChEBI API migration: Updated all programmatic links from the legacy SOAP services to the new REST API endpoints, following ChEBI’s platform migration. | |||
** Old services retired 1 September 2025. | |||
** New stable API: [https://www.ebi.ac.uk/chebi/backend/api/docs ChEBI REST API docs] | |||
** New data products and beta interface available at [https://www.ebi.ac.uk/chebi/beta/ ChEBI 2.0]. | |||
== Version 1.0.2 == | |||
=== Data Updates === | |||
* Published updated [https://www.metabolomicsworkbench.org/ Metabolomics Workbench] data. | |||
* Published sample data from the [https://edrn.nci.nih.gov/ Early Detection Research Network]. | |||
=== Backend and Infrastructure Updates === | |||
* <code>evidence_source</code> database names now retain their original casing for accuracy and consistency. | |||
* EDRN identifiers were added to the [https://github.com/clinical-biomarkers/format-converter/blob/main/mapping_data/namespace_map.json namespace map]. | |||
* [https://www.genenames.org/ HUGO Gene Nomenclature Committee] (HGNC) was added to the cross-reference JSON file. | |||
* Fixed an issue where <code>evidence_source</code> values without tags were previously dropped; these are now preserved. | |||
* Added a user-guided spelling correction function to improve data entry quality. | |||
* The TSV-to-JSON converter now automatically checks for header spelling errors. | |||
* Introduced <code>_suggest_header_corrections</code> to flag and propose fixes for misspelled headers. | |||
* Enhanced <code>_stream_tsv</code> with a call to <code>_check_header_spelling</code> to prevent invalid headers from being processed. | |||
== Version 1.0.1 == | == Version 1.0.1 == | ||
=== Data Updates === | |||
* Added <code> xrefs.tsv</code> to the list of datasets. | * Added <code> xrefs.tsv</code> to the list of datasets. | ||
=== Backend & Infrastructure Updates === | |||
* Fixed ID formatting issues in NCBI and UniProt references within <code> oncomx.tsv</code>, removing erroneous spaces (e.g., <code> NCBI: 3288</code> → <code> NCBI:3288</code>) and extraneous text (e.g., <code>"(composition)"</code>). Affected biomarkers included AN6295-1, AN6756-1, AN6728-1, and others. | * Fixed ID formatting issues in NCBI and UniProt references within <code> oncomx.tsv</code>, removing erroneous spaces (e.g., <code> NCBI: 3288</code> → <code> NCBI:3288</code>) and extraneous text (e.g., <code>"(composition)"</code>). Affected biomarkers included AN6295-1, AN6756-1, AN6728-1, and others. | ||
* Merged assessed entity type synonyms. | |||
== Version 1.0.0 == | |||
* BiomarkerKB data portal available with OncoMX, OpenTargets, MarkerDB, ClinVar, PubMed Central Biomarker Gene Set Curation, MW, UniProtKB, GWAS, CIViC biomarker data. | |||
Latest revision as of 16:38, 17 October 2025
Versioning Format
The versioning format follows a three-digit structure: X.Y.Z.
- The first digit (X) changes when a major update is introduced, such as changes in the data model.
- The second digit (Y) increments with each new release.
- The third digit (Z) is updated for bug fixes or minor changes.
Version 1.0.4
This release introduces new datasets, cross-references, and bug fixes.
Data Updates
- Added Cancer Genome Interpreter data on cancer biomarkers from MetaKB.
- Added Metabolomics Workbench LOINC data on metabolite biomarkers.
- Added Cell Ontology and Protein Ontology cross-references.
Bug Fixes
- Fixed issue where cookie preferences weren't being saved when selecting "Allow".
Version 1.0.3
This release introduces new cross-references and updates to ensure compatibility with external resources.
Data Updates
- NCBI cross-references added across gene biomarker entries.
- ChEBI cross-references integrated for small molecules and metabolites.
Backend and Infrastructure Updates
- ChEBI API migration: Updated all programmatic links from the legacy SOAP services to the new REST API endpoints, following ChEBI’s platform migration.
- Old services retired 1 September 2025.
- New stable API: ChEBI REST API docs
- New data products and beta interface available at ChEBI 2.0.
Version 1.0.2
Data Updates
- Published updated Metabolomics Workbench data.
- Published sample data from the Early Detection Research Network.
Backend and Infrastructure Updates
evidence_sourcedatabase names now retain their original casing for accuracy and consistency.- EDRN identifiers were added to the namespace map.
- HUGO Gene Nomenclature Committee (HGNC) was added to the cross-reference JSON file.
- Fixed an issue where
evidence_sourcevalues without tags were previously dropped; these are now preserved. - Added a user-guided spelling correction function to improve data entry quality.
- The TSV-to-JSON converter now automatically checks for header spelling errors.
- Introduced
_suggest_header_correctionsto flag and propose fixes for misspelled headers. - Enhanced
_stream_tsvwith a call to_check_header_spellingto prevent invalid headers from being processed.
Version 1.0.1
Data Updates
- Added
xrefs.tsvto the list of datasets.
Backend & Infrastructure Updates
- Fixed ID formatting issues in NCBI and UniProt references within
oncomx.tsv, removing erroneous spaces (e.g.,NCBI: 3288→NCBI:3288) and extraneous text (e.g.,"(composition)"). Affected biomarkers included AN6295-1, AN6756-1, AN6728-1, and others. - Merged assessed entity type synonyms.
Version 1.0.0
- BiomarkerKB data portal available with OncoMX, OpenTargets, MarkerDB, ClinVar, PubMed Central Biomarker Gene Set Curation, MW, UniProtKB, GWAS, CIViC biomarker data.