dc.contributor.author | BAS, Albert | |
dc.contributor.author | MUNTEANU, Viorel | |
dc.date.accessioned | 2024-10-10T10:07:04Z | |
dc.date.available | 2024-10-10T10:07:04Z | |
dc.date.issued | 2024 | |
dc.identifier.citation | BAS, Albert and Viorel MUNTEANU. A comprehensive assessment of sequence read archive metadata completeness. In: Conferinţa tehnico-ştiinţifică a studenţilor, masteranzilor şi doctoranzilor = Technical Scientific Conference of Undergraduate, Master and PhD Students, Universitatea Tehnică a Moldovei, 27-29 martie 2024. Chișinău, 2024, vol. 1, pp. 332-339. ISBN 978-9975-64-458-7, ISBN 978 9975-64-459-4 (Vol.1). | en_US |
dc.identifier.isbn | 978-9975-64-458-7 | |
dc.identifier.isbn | 978 9975-64-459-4 | |
dc.identifier.uri | http://repository.utm.md/handle/5014/27973 | |
dc.description.abstract | Recent advances in high-throughput sequencing technologies have enabled the collection and sharing of a vast amount of omics data, along with its associated metadata. Enhancing the availability of this metadata is crucial to ensure the reusability and reproducibility of raw data, as well as for facilitating novel biomedical discoveries through efficient data reuse. In this study, we performed a comprehensive assessment of metadata completeness by analyzing over 26,000,000 experiments shared in the Sequence Read Archive (SRA) from 2008 to 2023. Our results show that the countries of Central Europe, the USA and China show dominance in generating sequencing data, corresponding to 45%, 16% and correspondingly 8% of total data in the SRA repository, the most frequently used platform is ILLUMINA (90%). Identified that some of the metadata contains inconsistencies in completeness: the absence of temporary identifiers (5.2%), the lack of assigned TaxonomyID (5%), and the absence of library strategy (8%). Our results highlight the urgent need for improved metadata sharing practices and the standardization of reporting. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Universitatea Tehnică a Moldovei | en_US |
dc.relation.ispartofseries | Conferinţa tehnico-ştiinţifică a studenţilor, masteranzilor şi doctoranzilor = Technical Scientific Conference of Undergraduate, Master and PhD Students: Chişinău, 27-29 martie 2024. Vol. 1; | |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 United States | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/us/ | * |
dc.subject | metadata | en_US |
dc.subject | data reusability | en_US |
dc.subject | Sequence Read Archive | en_US |
dc.title | A comprehensive assessment of sequence read archive metadata completeness | en_US |
dc.type | Article | en_US |
The following license files are associated with this item: