Skip to main content

Showing 1–5 of 5 results for author: Rhodes, J S

  1. arXiv:2410.04312  [pdf, other

    stat.ME

    Adjusting for Spatial Correlation in Machine and Deep Learning

    Authors: Matthew J. Heaton, Andrew Millane, Jake S. Rhodes

    Abstract: Spatial data display correlation between observations collected at neighboring locations. Generally, machine and deep learning methods either do not account for this correlation or do so indirectly through correlated features and thereby forfeit predictive accuracy. To remedy this shortcoming, we propose preprocessing the data using a spatial decorrelation transform derived from properties of a mu… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

  2. arXiv:2406.04421  [pdf, other

    cs.LG stat.ML

    Enhancing Supervised Visualization through Autoencoder and Random Forest Proximities for Out-of-Sample Extension

    Authors: Shuang Ni, Adrien Aumon, Guy Wolf, Kevin R. Moon, Jake S. Rhodes

    Abstract: The value of supervised dimensionality reduction lies in its ability to uncover meaningful connections between data features and labels. Common dimensionality reduction methods embed a set of fixed, latent points, but are not capable of generalizing to an unseen test set. In this paper, we provide an out-of-sample extension method for the random forest-based supervised dimensionality reduction met… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 7 pages, 3 figures

  3. arXiv:2307.01077  [pdf, other

    stat.ML cs.LG

    Supervised Manifold Learning via Random Forest Geometry-Preserving Proximities

    Authors: Jake S. Rhodes

    Abstract: Manifold learning approaches seek the intrinsic, low-dimensional data structure within a high-dimensional space. Mainstream manifold learning algorithms, such as Isomap, UMAP, $t$-SNE, Diffusion Map, and Laplacian Eigenmaps do not use data labels and are thus considered unsupervised. Existing supervised extensions of these methods are limited to classification problems and fall short of uncovering… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: 10 pages

  4. arXiv:2201.12682  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Geometry- and Accuracy-Preserving Random Forest Proximities

    Authors: Jake S. Rhodes, Adele Cutler, Kevin R. Moon

    Abstract: Random forests are considered one of the best out-of-the-box classification and regression algorithms due to their high level of predictive performance with relatively little tuning. Pairwise proximities can be computed from a trained random forest and measure the similarity between data points relative to the supervised task. Random forest proximities have been used in many applications including… ▽ More

    Submitted 28 February, 2023; v1 submitted 29 January, 2022; originally announced January 2022.

  5. arXiv:2006.08701  [pdf, other

    stat.ML cs.HC cs.LG stat.AP

    Supervised Visualization for Data Exploration

    Authors: Jake S. Rhodes, Adele Cutler, Guy Wolf, Kevin R. Moon

    Abstract: Dimensionality reduction is often used as an initial step in data exploration, either as preprocessing for classification or regression or for visualization. Most dimensionality reduction techniques to date are unsupervised; they do not take class labels into account (e.g., PCA, MDS, t-SNE, Isomap). Such methods require large amounts of data and are often sensitive to noise that may obfuscate impo… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Comments: 21 pages, 9 figures