Impact Factor:6.549
 Scopus Suggested Journal: Tracking ID for this title suggestion is: 55EC484EE39417F0

International Journal
of Computer Engineering in Research Trends (IJCERT)

Scholarly, Peer-Reviewed, Platinum Open Access and Multidisciplinary




Welcome to IJCERT

International Journal of Computer Engineering in Research Trends. Scholarly, Peer-Reviewed, Platinum Open Access and Multidisciplinary

ISSN(Online):2349-7084                 Submit Paper    Check Paper Status    Conference Proposal

Back to Current Issues

Classification of Concept Drifting Data Streams Using Adaptive Novel-Class Detection

Ms. Aparna Yeshwantrao Ladekar, Dr. M.Y. Joshi, , ,
Affiliations
MGM’s College of Engineering, SRTMUN University, Nanded
:10.22362/ijcert/2016/v3/i9/48901


Abstract
In data stream classification there are many problems observed by the data mining community. Four major problems are addressed, such as, concept-drift, infinite length, feature-evolution and concept-evolution. Concept-drift occurs when underlying concept changes which is common in data streams. Practically it is not possible to store and use all data for training purpose whenever required due to infinite length of data streams. Feature evolution frequently occurs in many text streams. In text streams new features like words or phrases may occur when stream progresses. New classes evolving in the data stream which occurs concept-evolution as a result. Most existing classification techniques of data stream consider only the first two challenges, and ignore the latter two. Classification of concept-drifting data stream using adaptive novel-class detection approach is used to solve concept-drift and concept-evolution problem where novel-class detector is maintained with classifier. Novel-class detector is more adaptive to the dynamic and evolving data streams. It enables to detect more than one novel-class simultaneously. This approach solves feature-evolution problem by using feature set homogenization technique. Experiments done on Twitter data set and got reduced ERR rate and increased detection rate as a result. This approach is very effective as compared with existing data stream classification techniques


Citation
Aparna Yeshwantrao Ladekar et.al ," Classification of Concept Drifting Data Streams Using Adaptive Novel-Class Detection”, International Journal of Computer Engineering In Research Trends, Volume 3, Issue 9, September-2016, pp. 514-520


Keywords : — Concept-drift, concept-evolution, data streams, novel-class, outlier

References
[1] Mohammad M. Masud, Member, IEEE, Qing Chen, Member, IEEE,
Latifur Khan, Senior Member, IEEE, Charu C. Aggarwal, Fellow,
IEEE, Jing Gao, Member, IEEE, Jiawei Han, Fellow, IEEE, Ashok
Srivastava, Senior Member, IEEE, and Nikunj C. Oza, Member,
IEEE,” Classification and Adaptive Novel-class Detection of Feature-Evolving
Data Streams,” IEEE Transactions on Knowledge
and Data Engineering, vol. 25, no. 7, July 2013.
[2] M.M. Masud, Q. Chen, J. Gao, L. Khan, J. Han, and B.M.
Thuraisingham, “Classification and Novel-class Detection of Data
Streams in a Dynamic Feature Space,” Proc. European
Conf.Machine Learning and Knowledge Discovery in Databases
(ECML PKDD), pp. 337-352, 2010.
[3] M.M. Masud, J. Gao, L. Khan, J. Han, and B.M. Thuraisingham,
“Integrating Novel-class Detection with Classification for ConceptDrifting
Data Streams,” Proc. European Conf. Machine Learning
and Knowledge Discovery in Databases (ECML PKDD), pp. 79-94,
2009.
[4] A. Bifet and R. Kirkby. Data stream mining − a practical approach.
 http://moa.cs.waikato.ac.nz/downloads/.
[5] M.M. Masud, Q. Chen, L. Khan, C. Aggarwal, J. Gao, J. Han, and
B.M. Thuraisingham, “Addressing Concept-Evolution in ConceptDrifting
Data Streams,” Proc. IEEE Int’l Conf. Data Mining (ICDM),
pp. 929-934, 2010.
[6] G. Hulten, L. Spencer, and P. Domingos, “Mining Time-Changing
Data Streams,” Proc. ACM SIGKDD Seventh Int’l Conf. Knowledge
Discovery and Data Mining, pp. 97-106, 2001.
[7] Christopher D. Manning, Prabhakar Raghavan & Hinrich Schütz,
“Introduction to Information Retrieval,” e, 2008.
[8] “Stemming”, http://en.wikipedia.org/wiki/Stemming.
[9] M.F.Porter, “An algorithm for suffix stripping,” Computer Laboratory,
Cambridge.
[10] E.J.Spinosa, A.P. de Leon F. de Carvalho, and J. Gama, “ClusterBased
Novel Concept Detection in Data Streams Applied to Intrusion
Detection in Computer Networks,”Proc. ACM Symp. Applied
Computing (SAC), pp. 976-980, 2008.
[11] I. Katakis, G. Tsoumakas, and I. Vlahavas, “Dynamic Feature Space
and Incremental Feature Selection for the Classification of Textual
Data Streams, ” Proc. IntlWorkshop Knowledge Discovery from
Data Streams (ECML/PKDD), pp. 102-116, 2006.
[12] M.M. Masud, J. Gao, L. Khan, J. Han, and B.M. Thuraisingham,
“Classification and Novel-class Detection in Concept-Drifting Data
Streams under Time Constraints,” IEEE Trans. Knowledge and Data
Eng., vol. 23, no. 6, pp. 859-874, June 2011.
[13] B.Wenerstrom and C.Giraud-Carrier, “Temporal Data Mining in
Dynamic Feature Spaces,” Proc. Sixth Int’l Conf. Data Mining
(ICDM), pp. 1141-1145, 2006.
[14] W. Fan, “Systematic Data Selection to Mine Concept-Drifting Data
Streams,” Proc. ACM SIGKDD 10th Int’l Conf. Knowledge Discovery
and Data Mining, pp. 128-137, 2004. 


DOI Link : http://www.dx.doi.org/10.22362/ijcert/2016/v3/i9/48901

Download :
  V3I910.pdf


Refbacks : There are no ref

Support Us


We have kept IJCERT is a free peer-reviewed scientific journal to endorse conservation. We have not put up a paywall to readers, and we do not charge for publishing. But running a monthly journal costs is a lot. While we do have some associates, we still need support to keep the journal flourishing. If our readers help fund it, our future will be more secure.

Quick Links



DOI:10.22362/ijcert


Science Central

Score: 13.30





Submit your paper to editorijcert@gmail.com