Not logged in.

Contribution Details

Type Working Paper
Scope Discipline-based scholarship
Title Inference for dependent data with learned clusters
Organization Unit
  • Jianfei Cao
  • Christian Hansen
  • Damian Kozbur
  • Lucciano Villacorta
  • English
Institution Cornell University
Series Name
Number 2107.14677
ISSN 2331-8422
Number of Pages 37
Date 2021
Abstract Text This paper presents and analyzes an approach to cluster-based inference for dependent data. The primary setting considered here is with spatially indexed data in which the dependence structure of observed random variables is characterized by a known, observed dissimilarity measure over spatial indices. Observations are partitioned into clusters with the use of an unsupervised clustering algorithm applied to the dissimilarity measure. Once the partition into clusters is learned, a cluster-based inference procedure is applied to a statistical hypothesis testing procedure. The procedure proposed in the paper allows the number of clusters to depend on the data, which gives researchers a principled method for choosing an appropriate clustering level. The paper gives conditions under which the proposed procedure asymptotically attains correct size. A simulation study shows that the proposed procedure attains near nominal size in finite samples in a variety of statistical testing problems with dependent data.
Free access at Official URL
Official URL
PDF File Download from ZORA
Export BibTeX
Keywords Unsupervised learning, cluster-based Inference, HAR inference