NFL Play By Play Analysis

Blog Summary: (AI Summaries by Summarizes)
  • Advanced NFL Stats has released the play by play data of the 2002 season.
  • The author performed a quick analysis of the data using Hive and MapReduce to look at incomplete passes.
  • The code used for the analysis is available on the author's GitHub account.
  • The author created two graphs showing the most incomplete passes from a QB to a receiver and the same data averaged out over the number of seasons they played together and ordered by the highest average.
  • The author updated the analysis to include the 2010 data.

Advanced NFL Stats just released the play by play of the 2002 season on.

I some quick analysis of the data using Hive and MapReduce and decided to look at incomplete passes. The code is here on my GitHub account.

 

incompletes

Most incomplete passes from a QB to a receiver.

incompletesorderedbyaverage

Most incomplete passes from a QB to a receiver averaged out over the number of seasons they played together and ordered by the highest average.

incompleteswithseason

Update: Added in 2010 data.

Related Posts

zoomed in line graph photo

Data Teams Survey 2023 Follow-Up

Blog Summary: (AI Summaries by Summarizes)Many companies, regardless of size, are using data mesh as a methodology.Smaller companies may not necessarily need a data mesh

Laptop on a table showing a graph of data

Data Teams Survey 2023 Results

Blog Summary: (AI Summaries by Summarizes)A survey was conducted between January 24, 2023, and February 28, 2023, to gather data for the book “Data Teams”

Black and white photo of three corporate people discussing with a view of the city's buildings

Analysis of Confluent Buying Immerok

Blog Summary: (AI Summaries by Summarizes)Confluent has announced the acquisition of Immerok, which represents a significant shift in strategy for Confluent.The future of primarily ksqlDB

Tall modern buildings with the view of the ocean's horizon

Brief History of Data Engineering

Blog Summary: (AI Summaries by Summarizes)Google created MapReduce and GFS in 2004 for scalable systems.Apache Hadoop was created in 2005 by Doug Cutting based on

Big Data Institute horizontal logo

Independent Anniversary

Blog Summary: (AI Summaries by Summarizes)The author founded Big Data Institute eight years ago as an independent, big data consulting company.Independence allows for an unbiased