Image from Google Jackets

ALU-map: A Natural Language Processing-Based Alu Feature Annotation on the Human Genome by Shreya Sharma

By: Contributor(s): Material type: TextTextPublication details: IIT Jodhpur Department of Bioscience and Bioengineering 2023Description: vii,39p. HBSubject(s): DDC classification:
  • 572.8 S531A
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
Holdings
Item type Home library Collection Call number Status Date due Barcode Item holds
Thesis Thesis S. R. Ranganathan Learning Hub Reference Theses 572.8 S531A (Browse shelf(Opens below)) Not for loan TM00540
Total holds: 0

Here's your revised text with corrected punctuation, DDC number, and five topical terms:

The present study is aimed at developing a database entitled "ALU-map" that structures information related to various roles of Alu elements using state-of-the-art techniques, including Natural Language Processing (NLP) models such as BERT and BioBERT. We have explored the performance of these models by training them on literature abstracts retrieved from the PubMed database. Each abstract was assigned 10 different biological labels, assuming that a given abstract can hold information related to any of these labels, meaning a task of multilabel classification. The study also aims to develop a fine-tuned BERT model that would classify Alu abstracts into all the above-mentioned categories. While fine-tuning these models performs well, there are key limitations, which we also discuss. Finally, we constructed a database where all Alu abstracts are annotated into 10 different categories. If an abstract belongs to a category, then 1 is assigned; otherwise, 0. This database provides information on the involvement of Alu elements at different levels of biology, such as genetic, transcriptomic, proteomic, pathways, and as biomarkers, where the biological functions of Alu elements have been reported. We strongly believe that this database holds immense potential to serve researchers and scientists working in the field, providing them with invaluable resources and aiding their advancements.

There are no comments on this title.

to post a comment.