Skip to main content
Cornell University

arXiv submission will be down for maintenance beginning 14:00 EDT Tuesday June 30th. The site should otherwise remain in operation.

Learn about arXiv becoming an independent nonprofit.
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > math > arXiv:2606.24766

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Mathematics > Statistics Theory

arXiv:2606.24766 (math)
[Submitted on 23 Jun 2026]

Title:A Concentration Inequality for the Covariance Matrix of an Arbitrary Subset of Random Vectors

Authors:Huikang Liu, Peng Wang, Laura Balzano
View a PDF of the paper titled A Concentration Inequality for the Covariance Matrix of an Arbitrary Subset of Random Vectors, by Huikang Liu and 2 other authors
View PDF HTML (experimental)
Abstract:Concentration inequalities for sample covariance matrices are fundamental tools in high-dimensional probability. Classical results typically assume that the selected random vectors are independent of the selection rule. In this paper, we study spectral concentration for sample covariance matrices formed from arbitrary, possibly data-dependent subsets of i.i.d. random vectors. Such data-dependent selection destroys the usual independence structure and makes standard covariance concentration bounds inapplicable. For i.i.d. Gaussian random vectors, we prove high-probability lower and upper bounds for the minimal and maximal eigenvalues of such selected covariance matrices. Compared with a direct union-bound argument, our results provide substantially sharper guarantees and allow much smaller subset proportions. We further discuss extensions from Gaussian to sub-Gaussian random vectors, and beyond independence to weakly dependent observations, with geometrically strong-mixing Gaussian sequences serving as a representative example of the latter. Finally, we apply the developed concentration inequalities to the K-subspace clustering problem under a low-rank Gaussian mixture model, where the optimal clusters are inherently data-dependent. Our results yield recovery guarantees showing that the clustering error of global minimizers decays polynomially with the signal-to-noise ratio.
Comments: 29 pages, 2 figures, 1 table
Subjects: Statistics Theory (math.ST); Optimization and Control (math.OC)
Cite as: arXiv:2606.24766 [math.ST]
  (or arXiv:2606.24766v1 [math.ST] for this version)
  https://doi.org/10.48550/arXiv.2606.24766
arXiv-issued DOI via DataCite

Submission history

From: Peng Wang [view email]
[v1] Tue, 23 Jun 2026 16:26:55 UTC (81 KB)
Full-text links:

Access Paper:

    View a PDF of the paper titled A Concentration Inequality for the Covariance Matrix of an Arbitrary Subset of Random Vectors, by Huikang Liu and 2 other authors
  • View PDF
  • HTML (experimental)
  • TeX Source
view license

Current browse context:

math.ST
< prev   |   next >
new | recent | 2026-06
Change to browse by:
math
math.OC
stat
stat.TH

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
Loading...

BibTeX formatted citation

Data provided by:

Bookmark

BibSonomy Reddit

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)

Code, Data and Media Associated with this Article

alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
ScienceCast (What is ScienceCast?)

Demos

Replicate (What is Replicate?)
Hugging Face Spaces (What is Spaces?)
TXYZ.AI (What is TXYZ.AI?)

Recommenders and Search Tools

Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
  • Author
  • Venue
  • Institution
  • Topic

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status