ERIM: An ensemble of rare itemset mining and its application in the automotive industry

dc.contributor.authorAkdaş, Devrim Naz
dc.contributor.authorBirant, Derya
dc.contributor.authorTaşer, Pelin Yıldırım
dc.date.accessioned2023-03-22T19:47:26Z
dc.date.available2023-03-22T19:47:26Z
dc.date.issued2022
dc.departmentBelirleneceken_US
dc.description.abstractDiscovering previously unknown anomalies that are rare and dramatically differ from the majority of the data is a critical need for the automotive industry. Rare itemset mining (RIM), one of the pattern-based methods, has been used for anomaly detection due to providing successful analysis results. However, several aspects still need to be explored, such as improving the mining process by identifying more targeted, valuable and reliable rare itemsets. Motivated by this fact, this study proposes a novel approach, named ensemble of rare itemset mining (ERIM), which investigates weak rare itemsets (WRIs) using different algorithms and aggregates these rules to obtain strong rare itemsets (SRIs). This study also combines four different RIM algorithms (Apriori Rare, Apriori Inverse, CORI and RP-Growth) as base learners for the first time. The proposed ERIM approach is a general methodology that can be applied to any field, but, in this study, it was used in the automotive industry as a case study. In the experiments, ERIM was applied to a real-world gear manufacturing dataset to discover anomalies in machine downtimes. The experimental results were evaluated in terms of the number of itemsets and the length of itemsets by giving some samples, as well. The results showed that the proposed ERIM approach gives more reliable common knowledge by jointly considering the relation between WRIs discovered by the base learners. The findings indicated that the proposed ERIM technique was successful in detecting anomalies whose support values are below 7.12. Furthermore, it is clear from the experimental results that the ERIM discovered the highest number of SRIs, 1403, each of which is a 3-itemset. Finally, the results showed that our method performed 43.37% better on average than state-of-the-art methods on the same dataset.en_US
dc.identifier.doi10.1111/exsy.13122
dc.identifier.issn0266-4720
dc.identifier.issn1468-0394
dc.identifier.scopus2-s2.0-85135353967en_US
dc.identifier.scopusqualityQ2en_US
dc.identifier.urihttps://doi.org/10.1111/exsy.13122
dc.identifier.urihttps://hdl.handle.net/20.500.14034/704
dc.identifier.wosWOS:000836939600001en_US
dc.identifier.wosqualityQ2en_US
dc.indekslendigikaynakWeb of Scienceen_US
dc.indekslendigikaynakScopusen_US
dc.language.isoenen_US
dc.publisherWileyen_US
dc.relation.journalExpert Systemsen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectanomaly detectionen_US
dc.subjectartificial intelligenceen_US
dc.subjectautomotive industryen_US
dc.subjectdata miningen_US
dc.subjectensemble learningen_US
dc.subjectrare itemset miningen_US
dc.subjectAssociation Ruleen_US
dc.subjectDowntimeen_US
dc.titleERIM: An ensemble of rare itemset mining and its application in the automotive industryen_US
dc.typeArticleen_US

Dosyalar

Orijinal paket
Listeleniyor 1 - 1 / 1
Küçük Resim Yok
İsim:
pelin yıldırım.pdf
Boyut:
1.36 MB
Biçim:
Adobe Portable Document Format
Açıklama:
Tam metin / Full text