Full metadata record

DC Field Value Language
dc.contributor.authorLawler, Robin-
dc.contributor.authorLiu, Yao-Hao-
dc.contributor.authorMajaya, Nessa-
dc.contributor.authorAllam, Omar-
dc.contributor.authorJu, Hyunchul-
dc.contributor.authorKim, Jin Young-
dc.contributor.authorJang, Seung Soon-
dc.date.accessioned2024-01-19T13:32:51Z-
dc.date.available2024-01-19T13:32:51Z-
dc.date.created2022-01-10-
dc.date.issued2021-10-07-
dc.identifier.issn1089-5639-
dc.identifier.urihttps://pubs.kist.re.kr/handle/201004/116268-
dc.description.abstractIn this study, we propose a novel method of pK(a) prediction in a diverse set of acids, which combines density functional theory (DFT) method with machine learning (ML) methods. First, the DFT method with B3LYP/6-31++G**/SM8 is used to predict pK(a), yielding a mean absolute error of 1.85 pK(a) units. Subsequently, such pK(a) values predicted from the DFT method are employed as one of 10 molecular descriptors for developing ML models trained on experimental data. Kernel Ridge Regression (KRR), Gaussian Process Regression, and Artificial Neural Network are optimized using three Pipelines: Pipeline 1 involving only hyperparameter optimization (HPO), Pipeline 2 involving HPO followed by a relative contribution analysis (RCA) and recursive feature elimination (RFE), and Pipeline 3 involving HPO followed by RCA and RFE on an expanded set of composite features. Finally, it is demonstrated that KRR with Pipeline 3 yields optimal pK(a) prediction at an MAE of 0.60 log units. This algorithm was then utilized to predict the pKa of 37 novel acids. The two most important features were determined to be the number of hydrogen atoms in the molecule and the degree of oxidation of the acid. The predicted pKa values were documented for future reference.-
dc.languageEnglish-
dc.publisherAMER CHEMICAL SOC-
dc.subjectACID DISSOCIATION-CONSTANTS-
dc.subjectDENSITY-FUNCTIONAL METHODS-
dc.subjectSOLVATION FREE-ENERGIES-
dc.subjectCOMPLETE BASIS-SET-
dc.subjectPHOSPHONIC ACID-
dc.subjectPROTON CONDUCTIVITY-
dc.subjectPROTOGENIC GROUP-
dc.subjectNEURAL-NETWORKS-
dc.subjectSULFONIC-ACID-
dc.subjectVALUES-
dc.titleDFT-Machine Learning Approach for Accurate Prediction of pK(a)-
dc.typeArticle-
dc.identifier.doi10.1021/acs.jpca.1c05031-
dc.description.journalClass1-
dc.identifier.bibliographicCitationJOURNAL OF PHYSICAL CHEMISTRY A, v.125, no.39, pp.8712 - 8722-
dc.citation.titleJOURNAL OF PHYSICAL CHEMISTRY A-
dc.citation.volume125-
dc.citation.number39-
dc.citation.startPage8712-
dc.citation.endPage8722-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.identifier.wosid000707037900020-
dc.identifier.scopusid2-s2.0-85116701045-
dc.relation.journalWebOfScienceCategoryChemistry, Physical-
dc.relation.journalWebOfScienceCategoryPhysics, Atomic, Molecular & Chemical-
dc.relation.journalResearchAreaChemistry-
dc.relation.journalResearchAreaPhysics-
dc.type.docTypeArticle-
dc.subject.keywordPlusACID DISSOCIATION-CONSTANTS-
dc.subject.keywordPlusDENSITY-FUNCTIONAL METHODS-
dc.subject.keywordPlusSOLVATION FREE-ENERGIES-
dc.subject.keywordPlusCOMPLETE BASIS-SET-
dc.subject.keywordPlusPHOSPHONIC ACID-
dc.subject.keywordPlusPROTON CONDUCTIVITY-
dc.subject.keywordPlusPROTOGENIC GROUP-
dc.subject.keywordPlusNEURAL-NETWORKS-
dc.subject.keywordPlusSULFONIC-ACID-
dc.subject.keywordPlusVALUES-
Appears in Collections:
KIST Article > 2021
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE