STATISTICS: What have I learned from my data set? And could I

Transcription

STATISTICS:
What have I learned from my data set?
And could I have been more clever in
producing/analyzing it?
GK-Series by Ln(a) Sellentin
By the end of this lecture series, we'll have...
Seen the difference between Frequentist and
Bayesian Statistics
● Assessed how many parameters a data set
supports
● Contrasted model selection with “improving
parameter constraints”
● Seen how to approximate likelihoods/posteriors,
and worried about Gaussianity
●
Reminder:
When we were young...
Anfängerpraktikum...
Task: Measure the current x-times.... what is the “true” current?
The typical procedure was:
●
Measure what you want to know, e.g. the
current I.
●
Measure it often.
●
Calculate a mean and a standard deviation.
●
Measure U, often, calculate mean and std.dev.
●
Use U=RI to get R, using Gaussian error
propagation.
This was low-level (frequentist) statistics.
Low-level because...
●
What you measured was already the variable you wanted to
know.
●
Your “estimator” is your measurement point.
●
Usually: have to estimate a parameter
●
need a set of multiple data points xi(p,q,r)
to estimate
●
●
→ need multiple sets of data points to know how the estimates
scatter
→ don't confuse the number and variance of data points with
the number and variance of estimates of your parameters
In the practical courses, this issue did not appear (due to the
given tasks).
Once, we had to conduct a fit:
●
Minimize
●
Which is a consequence of maximizing
(the likelihood)
→ What you are used to from the studies is Frequentist statistics.
The common aim of Frequentists
and Bayesians
Both assume there exists a “true” physical
quantitiy, which we want to know.
● Both have to infere it from imperfect data
● Imperfections: data set is finite, noisy, potentially
biased
● Both of these statistical approaches try to split
imperfections from the true underlying physics
●
For perfect data, the two approaches would
lead to identical results.
An (overly) simplistic view:
●
Frequentism is good if uncertainties in the
measurements dominate (which can be brought down by
keeping on measuring, 1/sqrt(N)...)
●
Bayesianism is good when uncertainties in the
data-interpretation dominate
LECTURE 1
●
●
Some cross-calibrations between particle
physics and cosmology
Especially: Why Bayesian statistics
Why Baysian statistics?
A typical lifecircle in astronomy...
Let's build an X-ray satellite
Building: Mirror shells
and coating with iridium
Image Credit: NASA/Chandra/Raytheon Company
Assembling
Image Credit: NASA/Chandra/Eastman-Kodak
Testing: traditional Frequentist's playground
Image Credit: Nasa/Chandra
Testing under mimicked space conditions
Image Credit: NASA/Chandra/NGST
Image Credit:NASA
Image Credit: NASA
Tes
ts
s
t
s
e
T
s!
Test
On Orbit Calibration
Tests
sts
e
T
s
st
Te
s
t
s
e
T
Tes
t
s
Tests
First Light Image: Public Release
Image Credit: NASA/Chandra/SAO
Illustration:NASA/CXC/D.Berry & M.Weiss

STATISTICS: What have I learned from my data set? And could I

Transcription

Similar documents

XJET - Chandra X-Ray Observatory (CXC)

The glow of a candle, the rise of the Sun

Read more

technology solution

NASA Grants Management under Uniform Guidance

momentum - Oklahoma State University

U.S. Space Policy Choices

How "Non-Disclosure Policies" Affect NASA`s Digital Image

Digital Fly-by-Wire Technology

Student`s book, pages 28 and 29

Spring 2004 - Department of Physics and Astronomy

Darul Arqam Southwest School

Astronaut: Mae Jemison Hubble Hero: Nancy Grace Roman Test

Cheryl C. Hurst Director Communication and Public Engagement

September 2015 Newsletter

Robin Hood Rally 2010 – Todd Reid – Sponsorship

Presentation - PDF version - NSTA Learning Center

Low-Cost Innovation in the US Space Program

The View from NASA

Cooperation Agreement with NASA signed

NAST History.3 - NAST - NASA Aerospace Support Team

Converting Student Potential Into Lifetime Achievement