A Monte Carlo and Kernel Density Estimation based virtual sample generation method for small data modeling problem

Qun Xiong Zhu, Zhi Hui Wang, Yan Lin He, Yuan Xu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Citations (Scopus)

Abstract

In early industrial production, due to the limited resources, enterprises need to use the limited data to analyze the production status and product quality in order to reduce the waste of resources and funds. This requires building a model with high accuracy. Due to the small amount of data, the accuracy of the model based on small samples is low. The technology of generating virtual sample is often used, according to the information interval between sample data to fill in it with an effective way to expand the amount of sample data. A novel kernel density estimation based on distribution with sample output variables is proposed. Monte Carlo sampling is used to fill the gap between sample distribution and realize the uniform distribution of samples. Combined with Bagging-RBF neural network and bat algorithm (BA), effective virtual samples are generated. Two experiments, MLCC and PTA, show that the virtual samples are more effective.

Original languageEnglish
Title of host publicationProceedings - 2020 Chinese Automation Congress, CAC 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1123-1128
Number of pages6
ISBN (Electronic)9781728176871
DOIs
Publication statusPublished - 6 Nov 2020
Externally publishedYes
Event2020 Chinese Automation Congress, CAC 2020 - Shanghai, China
Duration: 6 Nov 20208 Nov 2020

Publication series

NameProceedings - 2020 Chinese Automation Congress, CAC 2020

Conference

Conference2020 Chinese Automation Congress, CAC 2020
Country/TerritoryChina
CityShanghai
Period6/11/208/11/20

Keywords

  • bat algorithm
  • energy prediction
  • kernel density estimation
  • Monte Carlo
  • neural network
  • virtual sample generation

Fingerprint

Dive into the research topics of 'A Monte Carlo and Kernel Density Estimation based virtual sample generation method for small data modeling problem'. Together they form a unique fingerprint.

Cite this