RAPTT: An Exact Two-Sample Test in High Dimensions Using Random Projections

In high dimensions, the classical Hotelling’s T2 test tends to have low power or becomes undefined due to singularity of the sample covariance matrix. In this article, this problem is overcome by projecting the data matrix onto lower dimensional subspaces through multiplication by random matrices. We propose RAPTT (RAndom Projection T2-Test), an exact test for equality of means of two normal populations based on projected lower dimensional data. RAPTT does not require any constraints on the dimension of the data or the sample size. A simulation study indicates that in high dimensions the power of this test is often greater than that of competing tests. The advantages of RAPTT are illustrated on a high-dimensional gene expression dataset involving the discrimination of tumor and normal colon tissues.

Tags
Data and Resources
To access the resources you must log in

This item has no data

Identity

Description: The Identity category includes attributes that support the identification of the resource.

Field Value
PID https://www.doi.org/10.6084/m9.figshare.1486462.v1
PID https://www.doi.org/10.6084/m9.figshare.1486462
URL https://figshare.com/articles/RAPTT_An_Exact_Two_Sample_Test_in_High_Dimensions_Using_Random_Projections/1486462
URL https://dx.doi.org/10.6084/m9.figshare.1486462.v1
URL http://dx.doi.org/10.6084/m9.figshare.1486462
URL https://dx.doi.org/10.6084/M9.FIGSHARE.1486462
URL http://dx.doi.org/10.6084/m9.figshare.1486462.v1
Access Modality

Description: The Access Modality category includes attributes that report the modality of exploitation of the resource.

Field Value
Access Right Open Access
Attribution

Description: Authorships and contributors

Field Value
Author Radhendushka Srivastava
Author Li, Ping
Author Ruppert, David
Publishing

Description: Attributes about the publishing venue (e.g. journal) and deposit location (e.g. repository)

Field Value
Collected From figshare; Datacite
Hosted By figshare
Publication Date 2015-01-01
Publisher Figshare
Additional Info
Field Value
Language UNKNOWN
Resource Type Dataset
keyword FOS: Mathematics
keyword FOS: Health sciences
keyword Large p, small n
keyword FOS: Biological sciences
system:type dataset
Management Info
Field Value
Source https://science-innovation-policy.openaire.eu/search/dataset?datasetId=dedup_wf_001::2f29626d27c7c5bbe0edeef70ea90ae7
Author jsonws_user
Last Updated 3 January 2021, 06:20 (CET)
Created 3 January 2021, 06:20 (CET)