Skip to main content

Nonparametric Detection of Anomalous Data Streams

Author(s): Zou, Shaofeng; Liang, Yingbin; Poor, H Vincent; Shi, Xinghua

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1bg2h970
Full metadata record
DC FieldValueLanguage
dc.contributor.authorZou, Shaofeng-
dc.contributor.authorLiang, Yingbin-
dc.contributor.authorPoor, H Vincent-
dc.contributor.authorShi, Xinghua-
dc.date.accessioned2024-02-18T03:26:52Z-
dc.date.available2024-02-18T03:26:52Z-
dc.date.issued2017-07-31en_US
dc.identifier.citationZou, Shaofeng, Liang, Yingbin, Poor, H Vincent, Shi, Xinghua. (2017). Nonparametric Detection of Anomalous Data Streams. IEEE Transactions on Signal Processing, 65 (21), 5785 - 5797. doi:10.1109/tsp.2017.2733472en_US
dc.identifier.issn1053-587X-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr1bg2h970-
dc.description.abstractA nonparametric anomalous hypothesis testing problem is investigated, in which there are totally n observed sequences out of which s anomalous sequences are to be detected. Each typical sequence consists of m independent and identically distributed (i.i.d.) samples drawn from a distribution p, whereas each anomalous sequence consists of mi.i.d. samples drawn from a distribution q that is distinct from p. The distributions p and q are assumed to be unknown in advance. Distribution-free tests are constructed by using the maximum mean discrepancy as the metric, which is based on mean embeddings of distributions into a reproducing kernel Hilbert space. The probability of error is bounded as a function of the sample size m, the number s of anomalous sequences, and the number n of sequences. It is shown that with s known, the constructed test is exponentially consistent if m is greater than a constant factor of log n, for any p and q, whereas with s unknown, m should have an order strictly greater than log n. Furthermore, it is shown that no test can be consistent for arbitrary p and q if m is less than a constant factor of log n. Thus, the order-level optimality of the proposed test is established. Numerical results are provided to demonstrate that the proposed tests outperform (or perform as well as) tests based on other competitive approaches under various cases.en_US
dc.format.extent5785 - 5797en_US
dc.language.isoen_USen_US
dc.relation.ispartofIEEE Transactions on Signal Processingen_US
dc.rightsAuthor's manuscripten_US
dc.titleNonparametric Detection of Anomalous Data Streamsen_US
dc.typeJournal Articleen_US
dc.identifier.doidoi:10.1109/tsp.2017.2733472-
dc.identifier.eissn1941-0476-
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/journal-articleen_US

Files in This Item:
File Description SizeFormat 
1405.2294.pdf593.93 kBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.