GitPedia

Tempo

API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation

From databrickslabs·Updated May 11, 2026·View on GitHub·

Welcome to Tempo: timeseries manipulation for Spark. This project builds upon the capabilities of [PySpark](https://spark.apache.org/docs/latest/api/python/index.html) to provide a suite of abstractions and functions that make operations on timeseries data easier and highly scalable. The project is written primarily in Jupyter Notebook, distributed under the Other license, first published in 2020. Key topics include: data-analysis, data-science, pandas, python, scala.

Latest release: v0.1.29
November 18, 2024View Changelog →

tempo - Time Series Utilities for Data Teams Using Databricks

<p align="center"> <img src="https://raw.githubusercontent.com/databrickslabs/tempo/master/tempo%20-%20light%20background.svg" width="300px"/> </p>

Project Description

Welcome to Tempo: timeseries manipulation for Spark.
This project builds upon the capabilities of PySpark to provide
a suite of abstractions and functions that make operations on timeseries data easier and highly scalable.

NOTE that the Scala version of Tempo is now deprecated and no longer in development.

image
codecov
Downloads
PyPI version
docs
lines of code

Tempo Project Documentation

Contributors

Showing top 12 contributors by commit count.

View all contributors on GitHub →

This article is auto-generated from databrickslabs/tempo via the GitHub API.Last fetched: 6/27/2026