Full metadata record

DC Field Value Language
dc.contributor.authorJunseok, Kang-
dc.contributor.authorAhn, Sang Chul-
dc.date.accessioned2024-01-19T12:33:39Z-
dc.date.available2024-01-19T12:33:39Z-
dc.date.created2022-04-05-
dc.date.issued2022-02-
dc.identifier.issn2169-3536-
dc.identifier.urihttps://pubs.kist.re.kr/handle/201004/115654-
dc.description.abstractIn the recent research of Variational Prototyping-Encoder (VPE), the problem of classifying 2D flat objects of the unseen class has been addressed. VPE solves this problem by pre-learning the image translation task from real-world object images to their corresponding prototype images as a meta-task. VPE uses a single prototype for each object class. However, in general, a single prototype is not sufficient to represent a generic object class because the appearance can change significantly according to viewpoints and other factors. In this case, using VPE and a single prototype for each class in training can result in overfitting or performance degradation. One solution may be the use of multiple prototypes. However, this also requires costly sub-labeling for dividing the input class into smaller classes and assigning a prototype to each. Therefore, we propose a new learning method, the variational multi-prototype encoder (VaMPE), which can overcome the limitations of VPE and use multiple prototypes for each object class. The proposed method does not require additional sub-labeling other than simply adding multiple prototypes to each class. Through various experiments, we demonstrate that the proposed method outperforms VPE.-
dc.languageEnglish-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleVariational Multi-Prototype Encoder for Object Recognition Using Multiple Prototype Images-
dc.typeArticle-
dc.identifier.doi10.1109/ACCESS.2022.3151856-
dc.description.journalClass1-
dc.identifier.bibliographicCitationIEEE Access, v.10, pp.19586 - 19598-
dc.citation.titleIEEE Access-
dc.citation.volume10-
dc.citation.startPage19586-
dc.citation.endPage19598-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.identifier.wosid000761186700001-
dc.identifier.scopusid2-s2.0-85124826828-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.type.docTypeArticle-
dc.subject.keywordAuthorPrototypes-
dc.subject.keywordAuthorTask analysis-
dc.subject.keywordAuthorTraining-
dc.subject.keywordAuthorFeature extraction-
dc.subject.keywordAuthorDeep learning-
dc.subject.keywordAuthorPerturbation methods-
dc.subject.keywordAuthorNeural networks-
dc.subject.keywordAuthorDeep learning-
dc.subject.keywordAuthorvariational encoder-
dc.subject.keywordAuthorprototype learning-
dc.subject.keywordAuthorembedding space-
dc.subject.keywordAuthorimage classification-
Appears in Collections:
KIST Article > 2022
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE