Data Day

April 9, 2026

Day 2 of PyCon Lithuania 2026TAŠKIUS, Vilnius

Explore the world of data with Python. From data engineering and pipelines to analytics, visualization, and machine learning — learn how to work with data at scale and derive insights that matter.

Keynote Speakers

Keynote

Katharine Jarmul

Founder, Kjamistan

Katharine Jarmul focuses her work and research on privacy and security in data science, deep learning and AI. She is author of the well received book Practical Data Privacy (O'Reilly 2023) and has more than 10 years experience in machine learning/AI where she has helped build large scale AI systems with privacy and security built in. You can follow her work via her newsletter, Probably Private (https://probablyprivate.com) or on her website at kjamistan.com.

Learn more

Keynote

Tomas Peluritis

Head of Data, Mediatech

Tomas Peluritis is Head of Data at Mediatech and founder of Uncle Data, a newsletter and podcast for data engineers who like their insights practical. He's built data stacks from the ground up or adjusted as needed, led teams across multiple companies and countries, and speaks at conferences sharing lessons from experience — the kind that don't make it into the docs. He believes in simple over smart and writes about what he's actually seen work (and fail) in production. He lives in Vilnius, Lithuania with his family, plays Magic: The Gathering poorly, but tries to improve.

Learn more

What to Expect

Talks and workshops happening on Data Day

Full Schedule

11:00

4 parallel sessions

Workshop

Inovation hall (4th building)

120 min

Exposing Greenwashing: Satellite ML for Carbon Credit Verification

The carbon market is set to reach 1T dollars by 2030, yet 84% of offsets fail to deliver real climate benefits. Verification still relies on sparse site visits and self-reported data. This poster shows a Python workflow that audits carbon projects using satellite imagery and ML, detecting over-crediting and leakage in REDD+ sites. With open data and open-source tools, anyone can compare claimed versus observed forest outcomes and verify what projects actually deliver.

Neeraj Pandey

Neeraj is the co-founder of Vivid Climate, a climate management and DMRV platform. Neeraj is a polyglot. Over the years, he has worked on a variety of full-stack software and data-science applications, as well as computational arts, and likes the challenge of creating new tools and applications, and is an active international speaker with talks and tutorials presented at multiple conferences.

April 9, 2026

Keynote Speakers

Katharine Jarmul

Tomas Peluritis

What to Expect

Exposing Greenwashing: Satellite ML for Carbon Credit Verification

Leading Through the Shift: What Engineering Leadership Actually Looks Like

Python for Data Quality in 2025: Why tests alone are no longer enough

Beyond the Static 2D Plot - Spatial Data Storytelling in 4D

Stats Meets ML - What I learned from my Machine Learning Certification

Airflow Lessons They Don't Put in the Docs

Python, rust and arrow for data processing

Creative Data Storytelling with Python

Designing Python APIs for Data You Don’t Control

Quantum Machine Learning with Qiskit

Dataset Updates Without Losing Your Mind

Making African Languages Visible: A Python-Based Guide to Low-Resource Language

Data versioning

Master the Art of Schema Dissection: Operation Data Engineer

Beyond SHAP: Diagnosing Vector Embeddings with Visual Explainable AI

From Sports Stats to AI Safety: The Ranking Renaissance

Behind Every Instant Loan Is Data Science: How Python Scorecards Decide Credit Risk

Cloud Data Solutions Are Overrated: Building a Pan-European Business Database for Lunch Money

Get Tickets for Data Day

Explore Other Days

April 9, 2026

Keynote Speakers

Katharine Jarmul

Tomas Peluritis

What to Expect

Exposing Greenwashing: Satellite ML for Carbon Credit Verification

Leading Through the Shift: What Engineering Leadership Actually Looks Like

Python for Data Quality in 2025: Why tests alone are no longer enough

Beyond the Static 2D Plot - Spatial Data Storytelling in 4D

Stats Meets ML - What I learned from my Machine Learning Certification

Airflow Lessons They Don't Put in the Docs

Python, rust and arrow for data processing

Creative Data Storytelling with Python

Designing Python APIs for Data You Don’t Control

Quantum Machine Learning with Qiskit

Dataset Updates Without Losing Your Mind

Making African Languages Visible: A Python-Based Guide to Low-Resource Language

Data versioning

Master the Art of Schema Dissection: Operation Data Engineer

Beyond SHAP: Diagnosing Vector Embeddings with Visual Explainable AI

From Sports Stats to AI Safety: The Ranking Renaissance

Behind Every Instant Loan Is Data Science: How Python Scorecards Decide Credit Risk

Cloud Data Solutions Are Overrated: Building a Pan-European Business Database for Lunch Money

Get Tickets for Data Day

Explore Other Days