Scott Haines

Scott Haines

San Francisco Bay Area
3K followers 500+ connections

About

Professional / Personal goals:
1. Teach More
2. Mentor More
3. Learn…

Contributions

Activity

Join now to see all activity

Experience

  • Nike Graphic
  • -

    United States

  • -

    United States

  • -

    Mountain View, California, United States

  • -

    Mountain View, California, United States

  • -

    Mountain View, California

  • -

    San Francisco, California

  • -

  • -

    sunnyvale, ca

  • -

  • -

  • -

  • -

  • -

  • -

  • -

  • -

Education

Licenses & Certifications

Volunteer Experience

  • Hacker Dojo Graphic

    Teacher (PHP class)

    Hacker Dojo

    - 1 year 7 months

    Science and Technology

    The Hacker Dojo was looking for Volunteer's to teach programming to Bay Area professionals, and I wanted to find an outlet to share what I had learned in the field and help people get excited about code.

    Object-Oriented PHP 5
    From January to April of 2010, I taught Object-Oriented PHP 5 to a group of 10 students. We met once a week, and I would lecture for 2 hours, and we would work on in-class assignments for an hour / answer questions. Homework would be assigned as an outside lab…

    The Hacker Dojo was looking for Volunteer's to teach programming to Bay Area professionals, and I wanted to find an outlet to share what I had learned in the field and help people get excited about code.

    Object-Oriented PHP 5
    From January to April of 2010, I taught Object-Oriented PHP 5 to a group of 10 students. We met once a week, and I would lecture for 2 hours, and we would work on in-class assignments for an hour / answer questions. Homework would be assigned as an outside lab to help in-grain the ideas presented in class.

    By the end of the 6 weeks, the students had learned the MVC design pattern (Model-View-Controller), learned to build a web framework using the MVC pattern, and create dynamic PHP website that included Authentication (sha1 encryption, mysql), as well as working with PHP session.

    (class google group: http://groups.google.com/group/hacker-dojo-php/topics)

  • Teacher Ruby (Ruby with Rails 3.0)

    Hacker Dojo

    - 4 years 9 months

    Science and Technology

    Starting in November 2010, I lead my second class at the Hacker Dojo. The objective of this class was to teach Ruby novices the Ruby programming language, and then the Rails 3 framework. Since without a firm grasp of the underlying language a framework holds you back.

    In the class, we started with two weekend ruby language primer days (5 hours a session), and we covered the basics of the actual Ruby Language, for the primer I wrote a short book called a Ruby Primer…

    Starting in November 2010, I lead my second class at the Hacker Dojo. The objective of this class was to teach Ruby novices the Ruby programming language, and then the Rails 3 framework. Since without a firm grasp of the underlying language a framework holds you back.

    In the class, we started with two weekend ruby language primer days (5 hours a session), and we covered the basics of the actual Ruby Language, for the primer I wrote a short book called a Ruby Primer (https://github.com/newfront/hacker-dojo-rails3-code-base/tree/master/a_ruby_primer), then we followed with 4 classes all about using Rails 3.0 to build simple websites that became more difficult each week, and finally ended off with a faux twitter example.

    Meetup Group: http://www.meetup.com/HandsOnProgrammingEvents/events/15172524/
    Google Group: http://groups.google.com/group/dojo-ruby-on-rails/topics

  • Databricks Graphic

    Speaker

    Databricks

    - Present 6 years 11 months

    Science and Technology

    Presented at Spark Summit West 2018, and speak at meetups, and talks year round

  • Open Data Science Conference (ODSC) Graphic

    Speaker

    Open Data Science Conference (ODSC)

    - Present 6 years 6 months

    Science and Technology

    Workshop on Spark Streaming and Spark SQL at the ODSC West conference

Publications

  • Delta Lake : The Definitive Guide

    O’Reilly Media

    Discover how Delta Lake simplifies the process of building data lakehouses and data pipelines at scale. With this practical guide, data engineers, data scientists, and data analysts will explore key data reliability challenges and learn to apply modern data engineering and management techniques. You'll also understand how ACID transactions bring reliability to data lakehouses at scale.

    Authors Denny Lee, Prashanth Babu, Tristen Wentling, and Scott Haines explain how to harness the power…

    Discover how Delta Lake simplifies the process of building data lakehouses and data pipelines at scale. With this practical guide, data engineers, data scientists, and data analysts will explore key data reliability challenges and learn to apply modern data engineering and management techniques. You'll also understand how ACID transactions bring reliability to data lakehouses at scale.

    Authors Denny Lee, Prashanth Babu, Tristen Wentling, and Scott Haines explain how to harness the power of Delta Lake to increase your data productivity at scale. You'll learn how to run batch and streaming jobs concurrently on your data lake and accelerate the usability of your data by building effective and high-quality end-to-end pipelines, from data ingestion to analytics.

    This book helps you:

    Understand key data reliability challenges
    Examine data management and engineering techniques using the modern data stack
    Realize data reliability improvements using Delta Lake
    Concurrently run streaming and batch jobs against your data lake
    Execute update, delete, and merge commands
    Use time travel to rollback and examine previous versions of your data
    Build a streaming data quality pipeline following the medallion construct
    About the authors:

    Denny Lee is a Delta Lake maintainer and Apache Spark and MLflow contributor.

    Prashanth Babu is a Delta practitioner who works at Databricks.

    Tristen Wentling is a Delta practitioner who works at Databricks.

    Scott Haines is an Apache Spark and Delta Lake contributor who works at Nike.

    Other authors
    See publication
  • Modern Data Engineering with Apache Spark: A Hands-On Guide For Building Mission-Critical Streaming Applications

    Apress

    Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey beginning with the basics of data ingestion, processing, and transformation, and ending up with an entire local data platform running Apache Spark, Apache Zeppelin, Apache Kafka, Redis, MySQL, Minio…

    Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey beginning with the basics of data ingestion, processing, and transformation, and ending up with an entire local data platform running Apache Spark, Apache Zeppelin, Apache Kafka, Redis, MySQL, Minio (S3), and Apache Airflow. Apache Spark applications solve a wide range of data problems from traditional data loading and processing to rich SQL-based analysis as well as complex machine learning workloads and even near real-time processing of streaming data. Spark fits well as a central foundation for any data engineering workload. This book will teach you to write interactive Spark applications using Apache Zeppelin notebooks, write and compile reusable applications and modules, and fully test both batch and streaming. You will also learn to containerize your applications using Docker and run and deploy your Spark applications using a variety of tools such as Apache Airflow, Docker and Kubernetes. Reading this book will empower you to take advantage of Apache Spark to optimize your data pipelines and teach you to craft modular and testable Spark applications. You will create and deploy mission-critical streaming spark applications in a low-stress environment that paves the way for your own path to production.

    See publication
  • 97 Things Every Data Engineer Should Know

    OReilly

    Page 117-118: Pipe Dreams
    Page 119-120: Preventing the Data Lake Abyss

    This book is a collection of thoughts and words of wisdom from many talented engineers and thought leaders in the modern data space.

    See publication
  • The Rise of Operational Analytics

    OReilly

    Analyst report on the new field of NewSQL and the rise of continuous analytics systems for near real-time decision making.

    See publication
  • Evolutionary Communication

    CIOReview

    A short history of conversational systems, virtual assistants and where the conversational platforms will take us in the very near future.

    See publication

Recommendations received

More activity by Scott

View Scott’s full profile

  • See who you know in common
  • Get introduced
  • Contact Scott directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named Scott Haines in United States

Add new skills with these courses