Machine learning assisted exploration fo...

A Selective Overview of Variable Selection in High Dimensional Feature Space (Invited Review Article)

October 6, 2009

82% Match

Jianqing Fan, Jinchi Lv

Statistics Theory

High dimensional statistical problems arise from diverse fields of scientific research and technological development. Variable selection plays a pivotal role in contemporary statistical learning and scientific discoveries. The traditional idea of best subset selection methods, which can be regarded as a specific form of penalized likelihood, is computationally too expensive for many modern statistical applications. Other forms of penalized likelihood methods have been success...

Find SimilarView on arXiv

Introduction to Machine Learning for the Sciences

February 8, 2021

82% Match

Titus Neupert, Mark H Fischer, Eliska Greplova, ... , Denner M. Michael

Computational Physics

Disordered Systems and Neura...

Machine Learning

This is an introductory machine-learning course specifically developed with STEM students in mind. Our goal is to provide the interested reader with the basics to employ machine learning in their own projects and to familiarize themself with the terminology as a foundation for further reading of the relevant literature. In these lecture notes, we discuss supervised, unsupervised, and reinforcement learning. The notes start with an exposition of machine learning methods withou...

Find SimilarView on arXiv

Algebraic Ground Truth Inference: Non-Parametric Estimation of Sample Errors by AI Algorithms

June 15, 2020

82% Match

Andrés Corrada-Emmanuel, Edward Pantridge, Edward Zahrebelski, ... , Simeonov Simeon

Machine Learning

Binary classification is widely used in ML production systems. Monitoring classifiers in a constrained event space is well known. However, real world production systems often lack the ground truth these methods require. Privacy concerns may also require that the ground truth needed to evaluate the classifiers cannot be made available. In these autonomous settings, non-parametric estimators of performance are an attractive solution. They do not require theoretical models about...

Find SimilarView on arXiv

Experimentation at the Frontiers of Reality in Schubert Calculus

June 13, 2009

82% Match

Christopher Hillar, Luis Garcia-Puente, Campo Abraham Martin del, James Ruffo, Zach Teitler, ... , Sottile Frank

Algebraic Geometry

We describe a general framework for large-scale computational experiments in mathematics using computer resources that are available in most mathematics departments. This framework was developed for an experiment that is helping to formulate and test conjectures in the real Schubert calculus. Largely using machines in instructional computer labs during off-hours and University breaks, it consumed in excess of 350 GigaHertz-years of computing in its first six months of operati...

Find SimilarView on arXiv

Survey of explainable machine learning with visual and granular methods beyond quasi-explanations

September 22, 2020

82% Match

Boris 2 and 3 Kovalerchuk, Muhammad Aurangzeb 2 and 3 Ahmad, Ankur 2 and 3 Teredesai

Machine Learning

Computer Vision and Pattern ...

This paper surveys visual methods of explainability of Machine Learning (ML) with focus on moving from quasi-explanations that dominate in ML to domain-specific explanation supported by granular visuals. ML interpretation is fundamentally a human activity and visual methods are more readily interpretable. While efficient visual representations of high-dimensional data exist, the loss of interpretable information, occlusion, and clutter continue to be a challenge, which lead t...

Find SimilarView on arXiv

Non-linear Visual Knowledge Discovery with Elliptic Paired Coordinates

July 11, 2021

82% Match

Rose McDonald, Boris Kovalerchuk

Machine Learning

Computer Vision and Pattern ...

Graphics

It is challenging for humans to enable visual knowledge discovery in data with more than 2-3 dimensions with a naked eye. This chapter explores the efficiency of discovering predictive machine learning models interactively using new Elliptic Paired coordinates (EPC) visualizations. It is shown that EPC are capable to visualize multidimensional data and support visual machine learning with preservation of multidimensional information in 2-D. Relative to parallel and radial coo...

Find SimilarView on arXiv

Spurious Vanishing Problem in Approximate Vanishing Ideal

January 25, 2019

82% Match

Hiroshi Kera, Yoshihiko Hasegawa

Machine Learning

Approximate vanishing ideal is a concept from computer algebra that studies the algebraic varieties behind perturbed data points. To capture the nonlinear structure of perturbed points, the introduction of approximation to exact vanishing ideals plays a critical role. However, such an approximation also gives rise to a theoretical problem---the spurious vanishing problem---in the basis construction of approximate vanishing ideals; namely, obtained basis polynomials can be app...

Find SimilarView on arXiv

Learning a performance metric of Buchberger's algorithm

June 7, 2021

82% Match

Jelena Mojsilović, Dylan Peifer, Sonja Petrović

Commutative Algebra

Machine Learning

Symbolic Computation

Algebraic Geometry

Machine Learning

What can be (machine) learned about the complexity of Buchberger's algorithm? Given a system of polynomials, Buchberger's algorithm computes a Gr\"obner basis of the ideal these polynomials generate using an iterative procedure based on multivariate long division. The runtime of each step of the algorithm is typically dominated by a series of polynomial additions, and the total number of these additions is a hardware independent performance metric that is often used to eval...

Find SimilarView on arXiv

Interpretable Machine Learning for Discovery: Statistical Challenges \& Opportunities

August 3, 2023

82% Match

Genevera I. Allen, Luqin Gan, Lili Zheng

Machine Learning

Methodology

New technologies have led to vast troves of large and complex datasets across many scientific domains and industries. People routinely use machine learning techniques to not only process, visualize, and make predictions from this big data, but also to make data-driven discoveries. These discoveries are often made using Interpretable Machine Learning, or machine learning models and techniques that yield human understandable insights. In this paper, we discuss and review the fi...

Find SimilarView on arXiv

Discovering Support and Affiliated Features from Very High Dimensions

June 27, 2012

82% Match

Yiteng Nanyang Technological University Zhai, Mingkui Nanyang Technological University Tan, ... , Ong Yew Soon Nanyang Technological University

Machine Learning

In this paper, a novel learning paradigm is presented to automatically identify groups of informative and correlated features from very high dimensions. Specifically, we explicitly incorporate correlation measures as constraints and then propose an efficient embedded feature selection method using recently developed cutting plane strategy. The benefits of the proposed algorithm are two-folds. First, it can identify the optimal discriminative and uncorrelated feature subset to...

Find SimilarView on arXiv

Machine learning assisted exploration for affine Deligne-Lusztig varieties

A Selective Overview of Variable Selection in High Dimensional Feature Space (Invited Review Article)

Introduction to Machine Learning for the Sciences

Algebraic Ground Truth Inference: Non-Parametric Estimation of Sample Errors by AI Algorithms

Experimentation at the Frontiers of Reality in Schubert Calculus

Survey of explainable machine learning with visual and granular methods beyond quasi-explanations

Non-linear Visual Knowledge Discovery with Elliptic Paired Coordinates

Spurious Vanishing Problem in Approximate Vanishing Ideal

Learning a performance metric of Buchberger's algorithm

Interpretable Machine Learning for Discovery: Statistical Challenges \& Opportunities

Discovering Support and Affiliated Features from Very High Dimensions