Machine learning: Trends, perspectives, and prospects
MI Jordan, TM Mitchell - Science, 2015 - science.org
Machine learning addresses the question of how to build computers that improve
automatically through experience. It is one of today's most rapidly growing technical fields …
automatically through experience. It is one of today's most rapidly growing technical fields …
A random forest guided tour
The random forest algorithm, proposed by L. Breiman in 2001, has been extremely
successful as a general-purpose classification and regression method. The approach, which …
successful as a general-purpose classification and regression method. The approach, which …
Communication-efficient distributed statistical inference
We present a communication-efficient surrogate likelihood (CSL) framework for solving
distributed statistical inference problems. CSL provides a communication-efficient surrogate …
distributed statistical inference problems. CSL provides a communication-efficient surrogate …
Big data and data science methods for management research
The recent advent of remote sensing, mobile technologies, novel transaction systems, and
highperformance computing offers opportunities to understand trends, behaviors, and …
highperformance computing offers opportunities to understand trends, behaviors, and …
Deep exploration via randomized value functions
We study the use of randomized value functions to guide deep exploration in reinforcement
learning. This offers an elegant means for synthesizing statistically and computationally …
learning. This offers an elegant means for synthesizing statistically and computationally …
Big data and management
Editor's note: This editorial launches a series written by editors and co-authored with a
senior executive, thought leader, or scholar from a different field to explore new content …
senior executive, thought leader, or scholar from a different field to explore new content …
Random forests for big data
Big Data is one of the major challenges of statistical science and has numerous
consequences from algorithmic and theoretical viewpoints. Big Data always involve massive …
consequences from algorithmic and theoretical viewpoints. Big Data always involve massive …
Bayes and big data: The consensus Monte Carlo algorithm
A useful definition of 'big data'is data that is too big to process comfortably on a single
machine, either because of processor, memory, or disk bottlenecks. Graphics processing …
machine, either because of processor, memory, or disk bottlenecks. Graphics processing …
[HTML][HTML] Distributed testing and estimation under sparse high dimensional models
This paper studies hypothesis testing and parameter estimation in the context of the divide-
and-conquer algorithm. In a unified likelihood based framework, we propose new test …
and-conquer algorithm. In a unified likelihood based framework, we propose new test …