Welcome to BEE 4850/5850!


Lecture 01

January 21, 2026

Course Introduction

About Me

Instructor: Prof. Vivek Srikrishnan,

Interests:

  • Bridging Earth science, data science, and decision science to improve climate risk management;
  • Unintended consequences which result from neglecting uncertainty or system dynamics.

Meet My Supervisors

My Supervisors

TA

What Do You Hope To Get Out Of This Course?

Take a moment, write it down, and we’ll share!

What We Are Doing This Semester?

My Philosophical Position

  • Probability theory helps us deduce logical implications of theories conditional on our assumptions
  • Cannot use an “objective” procedure to avoid subjective responsibility

Bart Statistics Meme

Model-Based Data Analysis

We can (transparently):

  • Examine logical implications of model assumptions (including interventions/out-of-sample generation).
  • Assess evidence for multiple hypotheses by generating simulated data.
  • Identify opportunities to design future experiments or observations to distinguish between competing hypotheses.

Model-Based Data Analysis

Models are how we assess evidence for theories.

Rock Paper Scissors meme

Course Organization

timeline
      Introduction and Exploratory Analysis (Weeks 1-2): Overview
                  : Why Analyze Data?
                  : Exploratory Analysis
                  : Data Visualization
      Probability Fundamentals (Weeks 3-6): Prob/Stats "Review"
                                          : Modeling Data-Generating Processes
                                          : Maximum Likelihood
                                          : Time Series
                                          : Extreme Values
      Simulation Methods (Weeks 7-10): Random Simulation
                                    : Monte Carlo
                                    : The Bootstrap
                                    : Missing Data and Imputation
      Model Evaluation (Weeks 12-13): Bias and Variance
                      : Cross-Validation
                      : Hypothesis Testing
                      : Information Criteria
      Experimental Design (Weeks 14-15): Causal Models and Confounds
                                 : Value of Information

Course Policies

Background Knowledge: Computing

  • Basics (at the level of CS 111x)
  • No specific language requirement.
  • Some extra work/effort may be needed if you haven’t coded in a while.
  • May need some additional familiarity with statistical packages (and “light” optimization)

Background Knowledge: Probability/Statistics

  • ENGRD 2700/CEE 3040
  • Summary statistics of data
  • Probability distributions
  • Basic visualizations
  • Monte Carlo basics

Grades

Assessment Weight
Literature Critique/Participation 5%
Labs 10%
Readings 10%
Homework Assignments 20%
Prelims 30%
Term Project 25%

Overall Guidelines

  • Collaboration highly encouraged, but all work must reflect your own understanding
  • Submit PDFs on Gradescope
  • 50% penalty for late submission (up to 24 hours)
  • Standard rubric available on website
  • Always cite external references

Literature Critique

  • For 5850 students
  • Select a paper which involves some type of statistical or data analysis
  • Critique choices: do they support the scientific conclusions?
  • Submit a 2-3 page writeup at the end of the semester
  • If you’re unsure where to look for a paper, talk to Prof. Srikrishnan

Readings

  • Several readings assigned for discussion throughout the semester.
  • Annotation assignments on Canvas: by end of the week.
  • Submit 1 page summary by next Monday, including description of key points and how they related to the course.
  • The more personal the reflections, the better.

Labs

  • In-class worksheets
  • Intended to get hands on or conceptual practice
  • Group collaboration encouraged, can submit jointly.
  • Should be able to do most of the work in class, due the next week.

Homework Assignments

  • More in-depth problems, mostly computational
  • Roughly 2 weeks to complete
  • Regrade requests must be made within one week

Prelims

  • Two in-class (non-cumulative) prelims.
  • Will focus on conceptual/interpretative problems.

Term Project

  • Analyze a question of interest using a data set of your choosing
  • Can work individually or groups of 2
  • Several deliverables throughout the semester

Term Project Structure

  • Updates due as we progress through material.
  • Final report due during finals week.

Attendance

Not required, but students tend to do better when they’re actively engaged in class.

Office Hours

  • Instructor: MWTh 10-11 AM, 318 Riley-Robb
  • TA: TBD
  • Almost impossible to find a time that works for all (or even most); please feel free to make appointments as/if needed.

Accomodations

If you have any access barriers in this class, please seek out any helpful accomodations.

  • Get an SDS letter.
  • If you need an accomodation before you have an official letter, please reach out to me ASAP!

Academic Integrity

Hopefully not a concern…

  • Collaboration is great and is encouraged!
  • Knowing how to find and use helpful resources is a skill we want to develop.
  • Don’t just copy…learn from others and give credit.
  • Submit your own original work.

Academic Integrity

Obviously, just copying down answers from Chegg or ChatGPT and passing them off as your own is not ok.

LLMs: Bullshit Generators

Think about ChatGPT as a drunk who tells stories for drinks.

It will give you plausible-looking text or code on any topic, but it doesn’t know anything beyond what it “overheard.”

ChatGPT can be useful for certain tasks (e.g. understanding code errors), but may neglect context for why/when certain information or solutions work.

ChatGPT: The Stochastic Parrot

Must specifically call out where you used ChatGPT in your work (beyond simple referencing; see syllabus for details).

Class Tools

Communications

Use Ed Discussion for questions and discussions about class, homework assignments, etc.

  • Try to use public posts so others can benefit from questions and can weigh in.
  • I will make announcements through Ed.

Email

When urgency or privacy is required, email is ok.

Important

Please include BEE4850 in your email subject line! This will ensure it doesn’t get lost in the shuffle.

Better: Use Ed Discussion and reserve email for matters that are particular urgent and/or require privacy.

Course Website

https://envdata.viveks.me/spring-2026/

  • Central hub for information, schedule, and policies
  • Will add link and some information to Canvas (assignment due dates, etc)

Computing Tools

  • Course is programming language-agnostic.
  • Assignments will have notebooks set up for Julia (environments, etc) on GitHub.

Some Tips For Success

  • Start the homeworks early; this gives time to sort out conceptual problems and debug.
  • Ask questions (in class and online) and try to help each other.
  • Give me feedback!

Upcoming Schedule

Next Classes

  • Why Analyze Data?

Assessments

Homework 1 available; due Friday (2/9).

References

References (Scroll for Full List)