Machine learning: Trends, perspectives, and prospects

MI Jordan, TM Mitchell - Science, 2015 - science.org
Machine learning addresses the question of how to build computers that improve
automatically through experience. It is one of today's most rapidly growing technical fields …

A random forest guided tour

G Biau, E Scornet - Test, 2016 - Springer
The random forest algorithm, proposed by L. Breiman in 2001, has been extremely
successful as a general-purpose classification and regression method. The approach, which …

Communication-efficient distributed statistical inference

MI Jordan, JD Lee, Y Yang - Journal of the American Statistical …, 2019 - Taylor & Francis
We present a communication-efficient surrogate likelihood (CSL) framework for solving
distributed statistical inference problems. CSL provides a communication-efficient surrogate …

Big data and data science methods for management research

G George, EC Osinga, D Lavie… - Academy of Management …, 2016 - journals.aom.org
The recent advent of remote sensing, mobile technologies, novel transaction systems, and
highperformance computing offers opportunities to understand trends, behaviors, and …

Deep exploration via randomized value functions

I Osband, B Van Roy, DJ Russo, Z Wen - Journal of Machine Learning …, 2019 - jmlr.org
We study the use of randomized value functions to guide deep exploration in reinforcement
learning. This offers an elegant means for synthesizing statistically and computationally …

Big data and management

G George, MR Haas, A Pentland - Academy of management …, 2014 - journals.aom.org
Editor's note: This editorial launches a series written by editors and co-authored with a
senior executive, thought leader, or scholar from a different field to explore new content …

Consistency of random forests

E Scornet, G Biau, JP Vert - 2015 - projecteuclid.org
Consistency of random forests Page 1 The Annals of Statistics 2015, Vol. 43, No. 4, 1716–1741
DOI: 10.1214/15-AOS1321 © Institute of Mathematical Statistics, 2015 CONSISTENCY OF …

Random forests for big data

R Genuer, JM Poggi, C Tuleau-Malot… - Big Data Research, 2017 - Elsevier
Big Data is one of the major challenges of statistical science and has numerous
consequences from algorithmic and theoretical viewpoints. Big Data always involve massive …

Bayes and big data: The consensus Monte Carlo algorithm

SL Scott, AW Blocker, FV Bonassi… - Big Data and …, 2022 - taylorfrancis.com
A useful definition of 'big data'is data that is too big to process comfortably on a single
machine, either because of processor, memory, or disk bottlenecks. Graphics processing …

[HTML][HTML] Distributed testing and estimation under sparse high dimensional models

H Battey, J Fan, H Liu, J Lu, Z Zhu - Annals of statistics, 2018 - ncbi.nlm.nih.gov
This paper studies hypothesis testing and parameter estimation in the context of the divide-
and-conquer algorithm. In a unified likelihood based framework, we propose new test …