DSpace Repository

The reusability of public omics data across 5 million research publications

Show simple item record

dc.contributor.author MUNTEANU, Viorel
dc.contributor.author DRABCINSKI, Nicolae
dc.contributor.author CIORBA, Dumitru
dc.contributor.author MANGUL, Serghei
dc.contributor.author BOSTAN, Viorel
dc.date.accessioned 2024-12-08T14:01:54Z
dc.date.available 2024-12-08T14:01:54Z
dc.date.issued 2024
dc.identifier.citation MUNTEANU, Viorel; Nicolae DRABCINSKI; Dumitru CIORBA; Serghei MANGUL and Viorel BOSTAN. The reusability of public omics data across 5 million research publications. In: Electronics, Communications and Computing (IC ECCO-2024): The conference program and abstract book: 13th intern. conf., Chişinău, 17-18 Oct. 2024. Technical University of Moldova. Chişinău: Tehnica-UTM, 2024, pp. 182-183. ISBN 978-9975-64-480-8 (PDF). en_US
dc.identifier.isbn 978-9975-64-480-8
dc.identifier.uri http://repository.utm.md/handle/5014/28807
dc.description Only Abstract en_US
dc.description.abstract Publicly accessible omics data are a vital resource for the scientific community, enabling re-analysis, experiments, and meta-analyses that promote reproducibility and fuel new discoveries. Despite their importance, the patterns and extent of secondary data reuse are not well understood. In this comprehensive study, we analyzed over five million open-access publications from 2001 to 2024, identifying 400,000 papers focused on omics data2. Among these, 58% of the publications reused publicly available datasets. Notably, from 2016 to 2024, there was a significant 30% increase in publications utilizing reused gene expression data3, surpassing the number of studies using newly generated data. For the study, we collected 5,547,235 open-access publications from PubMed Central (PMC), spanning the years 2001 to 2024. We identified 276,642 publications that mentioned omics datasets, such as those from the Sequence Read Archive (SRA) and Gene Expression Omnibus (GEO), using text mining and regular expressions. en_US
dc.language.iso en en_US
dc.publisher Technical University of Moldova en_US
dc.relation.ispartofseries Electronics, Communications and Computing (IC ECCO-2024): 13th intern. conf., 17-18 Oct. 2024;
dc.rights Attribution-NonCommercial-NoDerivs 3.0 United States *
dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/3.0/us/ *
dc.subject reproducibility en_US
dc.subject public omics data en_US
dc.subject data reuse en_US
dc.subject secondary analysis en_US
dc.title The reusability of public omics data across 5 million research publications en_US
dc.type Article en_US


Files in this item

The following license files are associated with this item:

This item appears in the following Collection(s)

  • 2024
    The 13th International Conference on Electronics, Communications and Computing (IC ECCO-2024)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 United States Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 United States

Search DSpace


Advanced Search

Browse

My Account