Full metadata record

DC Field Value Language
dc.contributor.authorBae, Suyeon-
dc.contributor.authorJeon, Mingyu-
dc.contributor.authorMoon, Hoi Ri-
dc.date.accessioned2025-07-29T05:00:11Z-
dc.date.available2025-07-29T05:00:11Z-
dc.date.created2025-07-28-
dc.date.issued2025-07-
dc.identifier.issn1359-7345-
dc.identifier.urihttps://pubs.kist.re.kr/handle/201004/152862-
dc.description.abstractThe rapid expansion of metal-organic framework (MOF) literature presents both a rich resource and a significant challenge for knowledge extraction. Text mining, which enables the conversion of unstructured scientific texts into structured, machine-readable data, has emerged as a key tool for accelerating data-driven research in the MOF domain. This review traces the development of text mining approaches in MOF research, from early manual curation and rule-based methods to recent breakthroughs powered by large language model (LLM)-based automation. We discuss the foundational role of natural language processing (NLP) and machine learning (ML) techniques such as named entity recognition and vector embedding models, followed by an in-depth analysis of LLM-based frameworks that enable flexible, scalable, and context-aware information extraction. Additionally, we introduce and compare their accuracy, and explore their diverse applications-including prediction of synthesizability, materials properties, and thermal stability. We conclude with a perspective on future directions for text mining in MOF research, including its integration into interactive graphical user interfaces, autonomous laboratories, multi-agent AI systems, and multi-modal LLM frameworks that can process textual, visual, and structural information in a unified way. This review aims to provide a foundational understanding for both experimental and computational researchers interested in adopting or advancing text mining methods in the MOF field.-
dc.languageEnglish-
dc.publisherRoyal Society of Chemistry-
dc.titleText mining in MOF research: from manual curation to large language model-based automation-
dc.typeArticle-
dc.identifier.doi10.1039/d5cc02511g-
dc.description.journalClass1-
dc.identifier.bibliographicCitationChemical Communications, v.61, no.60, pp.11083 - 11094-
dc.citation.titleChemical Communications-
dc.citation.volume61-
dc.citation.number60-
dc.citation.startPage11083-
dc.citation.endPage11094-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.identifier.wosid001522367100001-
dc.identifier.scopusid2-s2.0-105009932119-
dc.relation.journalWebOfScienceCategoryChemistry, Multidisciplinary-
dc.relation.journalResearchAreaChemistry-
dc.type.docTypeReview-
dc.subject.keywordPlusORGANIC FRAMEWORK SYNTHESIS-
Appears in Collections:
KIST Article > Others
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE