Scalable inference; statistical, algorithmic, computational aspects

Cambridge University

36 Episodes

The complexity and sheer size of modern data sets, of which ever increasingly demanding questions are posed, give rise to major challenges and opportunities for modern statistics. While likelihood-based statistical methods still provide the gold standard for statistical methodology, the applicability of existing likelihood methods to the most demanding of modern problems is currently limited. Thus traditional methodologies for numerical optimisation of likelihoods, and for simulating from complicated posterior distributions, such as Markov chain Monte Carlo and Sequential Monte Carlo algorithms often scale poorly with data size and model complexity, and thus fail for the most complex of modern problems.

The area of computational statistics is currently developing extremely rapidly, motivated by the challenges of the recent big data revolution, and enriched by new ideas from machine learning, multi-processor computing, probability and applied mathematical analysis. Motivation for this development comes from across the physical biological and social sciences, including physics, chemistry, astronomy, epidemiology, medicine, genetics, sociology, economics - in fact it is hard to find problems not enriched by big data and the resultant associated statistical challenges.

This programme will focus on methods associated with likelihood, its variants and approximations, taking advantage of, and creating new advances in statistical methodology. These advances have the potential to impact on all aspects of science and industry that rely on probabilistic models for learning from observational or experimental data.

Intractable likelihood problems are defined loosely as ones where the repeated evaluation of likelihood function (as required in standard algorithms for likelihood-based inference) is impossible or too computationally expensive to carry out. Scalable methods for carrying out statistical inference are loosely defined to be methods whose computational cost and statistical validity scale well with both model complexity and data size.

Understanding and developing scalable methods for intractable likelihood problems requires expertise across statistics, computer science, probability and numerical analysis. Thus it is imperative that the programme be broad, covering statistical, algorithmic and computational aspects of inference. The programme will cut across the traditional boundary between frequentist and Bayesian inference, and will incorporate both statistics and machine learning approaches to inference. Central to the focus will be the close integration of algorithm optimisation with the opportunities offered, and constraints imposed by modern multi-core technologies such as GPUs.

The first week of the programme will feature a broad-focused workshop, and more application specific activities will take place later.

Podcasts Similar to Scalable inference; statistical, algorithmic, computational aspects

Statistical scalability (98.18%)

Cambridge University

Signal Integrity (96.87%)

Colin Warwick, Agilent EEsof EDA

Flush to Data (96.79%)

Kris Villez and Jörg Rieckermann

Data Skeptic (96.37%)

Kyle Polich

Your Data Teacher Podcast (96.23%)

Your Data Teacher

Stanford MLSys Seminar (96.17%)

Dan Fu, Karan Goel, Fiodar Kazhamakia, Piero Molino, Matei Zaharia, Chris Ré

Within & Between (95.96%)

Within&Between Podcast

Machine Learning with Coffee (95.53%)

Gustavo Lujan

Department of Statistics (95.31%)

Oxford University

Data & Probability (95.31%)

None

Significant Statistics (95.23%)

John Russell

intuitions behind Data Science (95.14%)

Ashay Javadekar

The Analytical Wavelength (95.05%)

ACD/Labs

learning methods (94.85%)

Marihely Martínez

EAGE E-Lecture Series (94.77%)

Yury Petrachenko

Linear Digressions (94.63%)

Ben Jaffe and Katie Malone

A Propensity to Talk Density (94.46%)

Bell Geospace

Machine-Centric Science (94.46%)

Donny Winston

Data Science at Home (94.42%)

Francesco Gadaleta

People Analytics Deconstructed (94.4%)

Millan Chicago

Counting Sand (94.27%)

Angelo Kastroulis

DataCafé (94.25%)

Jason & Jeremy

Practical AI: Machine Learning, Data Science (94.23%)

Changelog Media

Earth Observation (94.13%)

mapscaping.com

Raising Heretics, the Podcast (94.07%)

Dr Linda McIver

Advanced Monte Carlo Methods for Complex Inference Problems (94.01%)

Cambridge University

White Privilege (93.93%)

Shaniya Trotter

Geospatial Concepts (93.86%)

mapscaping.com

Type Cast Heroes (93.81%)

Type Cast Heroes

the bioinformatics chat (93.79%)

Roman Cheplyaka

The Mathematics of Machine Learning - A Research Conference of the Cantab Capital Institute for the Mathematics of Information (93.77%)

Cambridge University

DataBytes (93.76%)

Jessi & Susan

Oil & Gas Measurement Podcast (93.76%)

Weldon Wright

Statistics for the Social Sciences (93.72%)

Dr. Brad R. Fulton

APC White Paper Podcast (93.7%)

American Power Conversion

An introduction to biological systematics - for iBooks (93.67%)

The Open University

No Bias (93.61%)

No Bias

Towards Data Science (93.61%)

The TDS team

Statistics (93.59%)

Susan Elizabeth Cooper-Nguyen

Austrian Ai Podcast (93.56%)

Manuel Pasieka

The Science Fair Podcast (93.56%)

sciencenugget.com

Model-Data Integration in Physical Systems (93.55%)

Cambridge University

The MapScaping Podcast - GIS, Geospatial, Remote Sensing, earth observation and digital geography (93.54%)

MapScaping

Professional Development Mirror (93.53%)

Itnesh_Data Science Enthusiasts

Deep Sky (93.52%)

Samuel Chandra

Research Methodology (93.48%)

Firos Khan

Thales Sehn Körting (93.47%)

Thales

Brief Overview of Data Analytics and Vit Tall LLC (93.46%)

Vit Tall LLC

Understanding Multi-Modal Data for Social and Human Behaviour (93.45%)

Cambridge University

Pondering AI (93.39%)

Kimberly Nevala, Strategic Advisor - SAS

The SofyanMarkarma Program (93.36%)

Sofyan Sofyan

Introduction to Machine Learning (93.35%)

edureka!

Vanishing Gradients (93.35%)

Hugo Bowne-Anderson

Core Dump (93.34%)

Luís Marques, Rita Morais

Quantitude (93.3%)

Greg Hancock & Patrick Curran

The Founder's Series (93.28%)

Impli Limited

Humans, Data and Machines (93.28%)

None

Disseminate: The Computer Science Research Podcast (93.24%)

JACK WAUDBY

Essay4Students (93.18%)

None

Beneath the Subsurface (93.15%)

TGS

The Mathematics of Deep Learning and Data Science (92.99%)

Cambridge University

MIS Grade 9 Haley MYP & IGCSE Math (92.99%)

Carl Haley

Dependent Variable Podcast (92.98%)

Founder360

Machine Learning Simplified (92.96%)

Priyanka Sharma

The Measurement Minute (92.96%)

Gary Angel

Talking Statistics (92.93%)

Mohammad Nasir Abdullah

Data Reflections (92.92%)

WSWHE BOCES Data Analysis Service

Satellite Superheroes (92.9%)

Scott MacKenzie

Math Analysis (92.87%)

None

The Machine Learning Podcast (92.83%)

Tobias Macey

Functional Design in Clojure (92.82%)

Christoph Neumann and Nate Jones

AH Fizzics (92.8%)

Sinclair Mackenzie

The Data Analysis Bureau Podcast (92.73%)

The Data Analysis Bureau

Programming (92.73%)

Minko Gechev

StreamNative's Podcast (92.72%)

StreamNative

AI Live & Unbiased (92.7%)

Dr. Jerry Smith

Zambezi Observations (92.67%)

Zambezi Capital

Big Data and the Role of Statistical Scalability (92.66%)

Cambridge University

R for the Rest of Us Podcast (92.66%)

David Keyes

Women in Data Science (92.53%)

Professor Margot Gerritsen, Cindy Orozco Bohorquez

MEASURE Evaluation (92.53%)

MEASURE Evaluation

Adventures in Machine Learning (92.52%)

Top End Devs

The Play Leadership Coaching (92.48%)

Ky Diep

Statistical Scalability for Streaming Data (92.48%)

Cambridge University

Learning Better and Faster (92.48%)

MLearning.ai

Linear Digressions (92.42%)

None

QuantumBlack Voices (92.41%)

QuantumBlack

Artificiality (92.37%)

Sonder Studio

Newton Gateway to Mathematics (92.36%)

Cambridge University

AI&U - Sharad Gandhi and Christian Ehl (92.36%)

AI&U - Sharad Gandhi and Christian Ehl

Your AI Injection (92.36%)

Deep

Distributed Data Management (WT 2019/20) - tele-TASK (92.36%)

Dr. Thorsten Papenbrock

GeocHemiSTea (92.36%)

Sam Scher

Maths Around You (92.35%)

Maths Podcasts

Ken's Nearest Neighbors (92.35%)

Ken Jee

Data Lit (92.34%)

Data, Research, and Accountability

Profit is a Science (92.33%)

David Primer

O'Reilly Data Show Podcast (92.29%)

O'Reilly Media

Data Stories (92.26%)

Enrico Bertini and Moritz Stefaner

Women in Analytics After Hours (92.25%)

Women in Analytics