HW4 is out! You can already do the first two problems!
HW3 Debrief form is still available until midnight tonight!
HW4 Check-in will be available this weekend. Check in with
your partners!
Recap
Determining the shape of an object while inside it is a non-trivial
task
Distance measurements are vital to piece together the shape of the
Milky Way
Parallax measurements cover a very tiny portion of the
Milky Way, and thus other methods are necessary
Main sequence fitting
Cepheid variables have their brightness fluctuate in a way that is
related to their luminosity
Discussing Today
Basic galaxy formation
Other examples of galaxies
Common galaxy classifications
Evaluating classification models
How to Build a Galaxy
Heavy Metal
Stars in the MW halo are old!
A smaller fraction of heavy elements than the Sun
Largely low-mass, red stars
Stars in the disk are relatively young
A greater or equal fraction of heavy elements to the Sun
Lots of high and low mass stars, both blue and red
Stars in the halo must have formed early in the
Milky Way’s history
When fewer heavy elements existed
There is little to no ISM (gas) still in the halo to form stars
from
Galaxy Formation
Any theory of galactic formation needs to predict these differences
between halo and disk stars
Current theory is that of a giant protogalactic cloud that
collapses under gravity
Halo stars form as it collapses
Then get left behind as angular momentum flattens the collapsing
cloud
Problems with Protogalactic Clouds
Stars and star clusters would be forming the entire way throughout
the cloud’s collapse
So halo stars far from the center would be older (on average) than
halo stars nearer the center
Would imply that far away halo stars should have less heavy
elements
But in truth, all halo stars have about the same
elemental composition
May suggest a collision between multiple protogalactic clouds?
Galaxy Collisions
Galaxies tend to cluster in groups, so collisions are a very real
possibility
Evidence that the Milky Way has already consumed two galaxies in the
past
The MW will collide with the Andromeda galaxy in about 5 billion
years
Nearby Galaxies
Our neighbors: The Magellanic Clouds
The Magellanic Clouds
Large and Small (about 160,000 and 200,000 lyrs away)
Irregular dwarf galaxies, though they do have some spiral
structure
Both orbit the Milky Way (or do they?!)
Only visible in the Southern hemisphere
Our neighbors: Andromeda Galaxy
Andromeda Galaxy Location
Our neighbors: Andromeda Galaxy
Andromeda central disk
Our neighbors: Andromeda Galaxy
Andromeda in IR
If we could see the fainter
stars
Types of Galaxies
Galactic Flavors
Looking beyond our neighbors, all galaxies tend to come in one of
three main types:
Spiral
Elliptical
Irregular
Spiral Galaxies
Many of the same characteristics of the Milky Way
Spiral disk, bulge, halo, etc.
Can come in normal or “barred” varieties
Spiral arms can be wrapped to varying degrees
Spiral Arms
Blue regions indicate star forming regions
Galaxy rotates at same speed, so inner regions have shorter
periods
If arms moved with the stars, they would get all wound up!
Spiral density waves:
Pinches everything together in that region
Doesn’t effect normal stars much
Help molecular clouds collapse to start star formation
Elliptical Galaxies
Differ from spirals in important ways:
Have no disk
Rotate more slowly
Contain very little gas or dust
Contain mainly old stars
Huge range of sizes:
0.0001-100 times the MW size
Irregular Galaxies
The misfits that don’t match one of the other categories
Often times harbor very active star forming regions
Sometimes the result of galaxy collisions
Hubble Fork
Hubble’s Classification Fork
Judging Classifications
Fitting vs Classification
We’ve seen several ways to fit models to data already
Basic linear fitting
Non-linear model fitting
Both give a prediction of a continuous variable given some
inputs
Classification is about predicting a discrete variable (or
factor)
Techniques over Theory
In this class, I’m going to focus on techniques over the underlying
mathematical theory
Problem-solving is often a game of abstraction, and using techniques
as tools can help with that
You don’t need to know the details of how a least-squares fit is
done to make use of it
For rigorous work, you should be aware of at least the
basic theory underlying a technique, at least well enough to know if you
are misusing it
I am going to present the machine learning techniques in this unit
in a similar, technique over theory fashion
We have other classes if you want a deep dive into this sort of
content!
Be Positive!
With regression fitting, we commonly have an idea of a
residual, which measures how far from an actual value our
prediction came
A similar idea won’t hold for classification, because we either
correctly classified the point, or we didn’t
Instead, for a binary classification (A or B), predictions would
fall into 1 of 4 different bins:
True positive: An observation that should have been in category A,
which our model predicted was in category A
False positive: An observation that should have been in category B,
but which our model predicted would be in category A
True negative: An observation that should have been in category B,
which our model predicted was in category B
False negative: An observation that should have been in category A,
but which our model predicted would be in category B
Confusion Matrix
For either binary classification or multinomial classification, a
confusion matrix is often the best method to summarize model
prediction results visually
Compares actual categories across one axis to
predicted categories across the other
Each bin contains a count of how many observations with that actual
value were predicted
Multinomial confusion matrix
Making Comparisons
Comparing just confusion matrices can be ambiguous
Which model best classified the data of the below options?
Precision and Recall
For a binary classification, there are clear methods of evaluating a
model
Precision is a measure of how much you can trust the model
if it claims a positive \[ \text{Precision }
= \frac{TP}{TP + FP} \]
Recall is a measure of how reliably the model finds all the
positive observations \[ \text{Recall } =
\frac{TP}{TP + FN} \]
Accuracy
One of the simplest extensions of this to multinomial data is to use
accuracy
Accuracy is a probability that, for a random observation, the
predicted class is correct \[ \text{Accuracy
} = \frac{\text{Diagonal counts}}{\text{Total observations}}
\]