Please use this identifier to cite or link to this item: http//localhost:8080/jspui/handle/123456789/12105
Full metadata record
DC FieldValueLanguage
dc.contributor.authorFETNI, Atika-
dc.date.accessioned2024-10-15T10:56:53Z-
dc.date.available2024-10-15T10:56:53Z-
dc.date.issued2024-06-08-
dc.identifier.urihttp//localhost:8080/jspui/handle/123456789/12105-
dc.description.abstractUnderstanding the language of non-coding DNA is a major topic in genomic research. Gene regulatory code is extremely complicated due to the presence of polysemy and distant semantic relationships, which earlier informatics approaches frequently fail to capture. To address this difficulty, we used DNABERT, a unique pre-trained bidirectional encoder representation that captures global and transferable comprehension of genomic DNA sequences based on up and downstream nucleotide contexts. We compared DNABERT to the most popular systems for predicting genome-wide regulatory elements and found that it was easier to use, more accurate, and more efficient. We demonstrate that a single pre-trained transformers model can reach state-of-the-art performance in the prediction of promoters, splice sites, and transcription factor binding sites following simple fine-tuning using modest task-specific labeled data. Furthermore, DNABERT allows for direct display of nucleotide-level significance and semantic relationships within input sequences, resulting in improved interpretability and more accurate identification of conserved sequence motifs and functional genetic variant possibilities.en_US
dc.language.isoenen_US
dc.publisherUniversity Larbi Tébessi – Tébessaen_US
dc.subjectDNA, BERT, DNABert, LRM, NLP.en_US
dc.titleBert based DNA pattern recognitionen_US
dc.typeThesisen_US
Appears in Collections:3- إعلام آلي

Files in This Item:
File Description SizeFormat 
Bert based DNA pattern recognition.pdf3,22 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Admin Tools