Posts

Does the Food Industry Drag Us Down in Protecting the Earth?

Food
US

This blog explores Greenhouse Gas Emissions related to food production and activities, specifically the issue with excessive food packaging and refrigerant gases.

Questioning Quirky Quercus: Using k-means clustering to identify species of oak trees in Portland

oaks
hybridization
portland
k-means clusters

Hybridization, the crossing of separate species, has been a concerning topic for biologists for hundreds of years. The hybridization of oaks in particular is a field where countless research efforts have been placed in order to distinguish parental species from their hybrids. In the city of Portland, city planted and naturally occurring *Quercus* *garryana* trees have exhibited physical traits that are more in line with the growth form attributed to *Quercus* *robur*. I used leaf morphology to compare Portland Park and natural area tree leaves to tree leaves from the *Q.* *garryana* and *Q.* *robur* home range as well as *k* - means clustering in order to answer this question about the true identity of these trees.

Decoding Coders

Coding
Logit
Regression
Kaggle
Programmers

We investigate how demographics influence programmers coding language use, and explore what might determine whether you are #teamR or #teamPython.

Waste Washing Up On Shores: A Look at Plastic Pollution in Earth’s Oceans

world
environment

This blog post explores plastic pollution around the globe.

Sentiment aRt

Art
Books

Using sentiment analysis to make reproducible data art.

'This Book Is A Flop': An Analysis of Negative Amazon Book Reviews

topic modeling
Amazon reviews
books
negative reviews
text mining

We used topic modeling to look for patterns in the content of negative reviews.

3D Data Visualization with {rgl} and {ggrgl}

science
global

Exploring aquatic arctic community patterns with {rgl} and {ggrgl}.

Under The Roof: A general view of Airbnbs in the US

AirBnb
Portland

Within 12 years since its inception in 2008, Airbnb takes over most of the traditional hostility industries and becomes the top accommodation choice for travellers. It claims to offer a more unique and personal way of accommodation. This blog seek to take a general view of the geographical distributions and price analysis of Airbnb accommodations, with a particular focus on Portland, Oregon.

Wide ReceiveR: Using PCA to group NFL Recievers Based Off Combine Measurements

Plotly
PCA
NFL

How can we apply dimensionality reduction algorithms to NFL combine data in order to give us insight into the different types and archetypes of wide recievers drafted to the NFL?

Board Game Geek Top 10: The Past 15 years

Lifestyle
Culture

Explore the top 10 games in the Board Game Geek database over time.

Biodiversity in National Parks

Science
US

In this post we compare treemap, fmsb, and igraph packages to the old favorite ggplot2. Along the way we examine the question: "How does species diversity differ between National Parks?"

Finding Holes in Data Using the TDA R Package

Education
Tech

This blog post serves as an overview of how to use the TDA package in R to determine if a point cloud has any holes in that aren't a result of noise in the data.

Mapping Litter

US
Environment

Using open source litter catalogging data, we compared the seasonal trends of two coastal cities.

Better than `ggplot2`?

highchart
dataviz

Exploring the `highcharter` package.

Comparing Apples to Oranges

Food
World

We set out to attempt to compare data on apples to data on oranges, but will those data be as incomparable as the idiom would lead us to believe?

Political Corruption in Former Spanish Colonies

Politics
World
corrplot

While political corruption looks different in every coutry, it also is very present in some...but not others. What are the effects of political corruption on other important political/economic factors of a country? To control for historical implications and different culture, we look at former Spanish colonies that gained independence in the early 1800s.

Covid-19's Newest Challenger: Turtles and biodiveristy data

Science
World
rgbif

The COVID-19 Pandemic has disrupted a lot of global processes, but what about the collection of global biodiveristy data, specifically on turtles, a group of animals in which most species are endangered? Using data from the Global Biodiversity Information Facility (GBIF) and their API wrapper package {rgbif}, we investigated how the pandemic has affected global biodiversity data collection.

Popular and Profitable Projects on Indiegogo

tech
culture

What's popular? What's profitable? Are they the same, or are they depressingly, disturbingly different? If you have rapidly waning dreams of being a successful entrepreneur, read this article and let us help you plan your next move!

Composition of Artworks in Tate Museum

Art
Culture
ggmap
leaflet
gganimate
janitor

How Tate Museum progressed since 1950-2013.

New York City Uber Rides

US
Lifestyle
Transportation

Using data from May 2015 as a case study, we analyzed Uber usage in New York City.

The Evolution of Modern Music

Music
Culture

Looking at Spotify data for a few influential artists, how has music changed over the past decades?

Getting Green by Betting Green

Science
Tech
fmpapi
topPolluters

Can you make money by betting against the biggest polluters in the stock market? Let's see what the data says...

Do Country Health Indicators Affect Olympic Performance?

Sports
World

An exploration of whether the body mass index (BMI) and caloric intake statistics for different countries have predictive power on how their performance in the Olympics.

Package Deal on Suite of R Data Packages

R Package

The students created several R data packages that contain fascinating data on cats, pollution, library check-outs, and so much more!

The US Supreme Court: Unanimity and Unions

US
Politics

What kinds of Supreme Court decisions are unanimous? How does this change when the cases being decided are union cases?

Exploring Words in PoKi Poems!

Education
Poetry

This post explores language data from poems written by students from first to twelfth grade.

Sadboi Hours-Are we listening to sadder music now than a decade ago?

Culture
Music
spotifyr
rvest

A Data Science project using data scraped from the Billboard Yearly Top Charts and Spotify track data.

A New Spectre of Gentrification in Portland

Politics
Culture

Portland's southeast is experiencing a new wave of gentrification. We seek to understand where hotspots of gentrification might be occuring, and then to find what factors are contributing to current population movements and displacements.

Recovering from Covid-19

covid-19
humanorgans

I do not need to emphasize how much the COVID-19 pandemic has effected the world and the people in it. The pandemic as forced lockdowns, states of emergency, and economic destress like never seen before. While many of the effects of the virus are known, there are many things left to be discovered. For example, what are the long term effects of the virus on those who survive it? In this blog post, I will look at the effect of recovery from COVID-19.

Welcome to Math 241 Blog

This is the Reed College Math 241: Data Science Blog where the students explore and learn from data!

More articles »

Posts

Reuse

Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".