A library to load data from the Trackinsight API
Project description
trackinsight-data-python
Installation
uv init
uv add trackinsight-data-python
uv add dotenv # needed to load environment variables
uv add polars # needed to manipulate dataframes
Setup the environment variables
Some variables are read from the environment.
We recommend managing environment variables in a local .env file and loading it via the dotenv library
# .env
TRACK_API_KEY=YOUR_KEY_HERE # your API key
TRACK_API_DL_WORKERS=10 # number of parallel threads when downloading data
TRACK_API_STORAGE=trackinsight_data # local folder to use when storing data on disk
TRACK_API_VERIFY_CERT=True # Set to False if certificate validation is not possible in you environment
Example Usage
# main.py
import polars as pl
import trackinsight_data_python as API
from dotenv import load_dotenv
load_dotenv(override=True)
ccy='usd'
metadata= API.getMetadata()
stamp = max(metadata["reportsAsOf"][ccy]) # get latest stamp for reports
reports = API.getReports(ccy=ccy,stamp=stamp) # reports is a Polars DataFrame, you can use .to_pandas() to convert it to a Pandas DataFrame
nuclear_etfs = reports.filter(pl.col('class_theme').list.join(';') =='Nuclear Energy')
ids = nuclear_etfs["share_id"].to_list()
usd_timeseries = API.getTimeseries(ids=ids,start='2024-01-01',end=None,ccy=ccy)
holdings = API.getHoldings(ids=ids)
uv run python main.py
Public API
The functions below are exported by trackinsight_data_python and can be imported from the package root:
import trackinsight_data_python as API
DataFrame Loaders
Use these functions when you want to load API data directly into memory as Polars DataFrames.
metadata = API.getMetadata(asDataFrame=False)
Returns available report and holdings partition metadata. When asDataFrame=True, returns the metadata as a Polars DataFrame instead of a dictionary.
shares_df = API.getShares()
Loads the full shares dataset into a Polars DataFrame.
stock_flows_df = API.getStockFlows(format='parquet')
Loads the full stock flows dataset into a Polars DataFrame. format must be one of the supported formats and defaults to parquet.
timeseries_df = API.getTimeseries(start='2019-01-01', end=None, ccy='eur', ids=None)
Loads timeseries rows for ccy between start and end. ccy must be one of the supported currencies. start and end use YYYY-MM-DD strings; end=None leaves the upper bound open. Use ids to restrict the result to specific share IDs.
reports_df = API.getReports(stamp=None, ccy='eur', ids=None, periods=None)
Loads report rows for ccy. ccy must be one of the supported currencies. When stamp=None, the latest available report stamp for the currency is used. periods is a list or tuple of supported period names; when None, the default report periods are requested. Use ids to restrict the result to specific share IDs.
holdings_df = API.getHoldings(ids=None, proxy=True, level=0, extraLines=False)
Loads holdings rows. proxy controls whether proxy holdings are included. level controls ETF look-through expansion depth. extraLines controls whether special portfolio lines are included. Use ids to restrict the result to specific share IDs.
liquidity_df = API.getLiquidity(start, end, ccy='eur', ids=None)
Loads liquidity rows for ccy between start and end. ccy must be one of the supported currencies. Dates use YYYY-MM-DD strings. Use ids to restrict the result to specific share IDs.
liquidity_summary_df = API.getLiquiditySummary(start, end, ccy='eur', ids=None)
Loads liquidity summary rows for ccy between start and end. ccy must be one of the supported currencies. Dates use YYYY-MM-DD strings. Use ids to restrict the result to specific share IDs.
Downloaders
Use these functions when you want to download API data to local files and receive a glob pattern pointing to the downloaded dataset. Files are written in a Hive-partitioned folder layout, which is well suited for query engines such as DuckDB and PyArrow-based tools.
API.downloadShares(format='parquet')
Downloads the shares dataset to disk and returns a glob pattern for the downloaded files. format must be one of the supported formats and defaults to parquet.
API.downloadStockFlows(format='parquet')
Downloads the stock flows dataset to disk and returns a glob pattern for the downloaded files. format must be one of the supported formats and defaults to parquet.
API.downloadReports(stamp=None, ccy='eur', format='parquet', periods=None)
Downloads report rows for ccy and returns a glob pattern for the downloaded files. ccy must be one of the supported currencies. When stamp=None, the latest available report stamp for the currency is used. periods is a list or tuple of supported period names; when None, the default report periods are requested. format must be one of the supported formats and defaults to parquet.
API.downloadTimeseries(start, end, ccy='eur', format='parquet')
Downloads timeseries rows for ccy between start and end, then returns a glob pattern for the downloaded files. ccy must be one of the supported currencies. Dates use YYYY-MM-DD strings. format must be one of the supported formats and defaults to parquet.
API.downloadHoldings(format='parquet', proxy=True, level=0, extraLines=False)
Downloads holdings rows and returns a glob pattern for the downloaded files. proxy, level, and extraLines have the same behavior as getHoldings(). format must be one of the supported formats and defaults to parquet.
API.downloadLiquidity(start, end, ccy='eur', format='parquet')
Downloads liquidity rows for ccy between start and end, then returns a glob pattern for the downloaded files. ccy must be one of the supported currencies. Dates use YYYY-MM-DD strings. format must be one of the supported formats and defaults to parquet.
API.downloadLiquiditySummary(start, end, ccy='eur', format='parquet')
Downloads liquidity summary rows for ccy between start and end, then returns a glob pattern for the downloaded files. ccy must be one of the supported currencies. Dates use YYYY-MM-DD strings. format must be one of the supported formats and defaults to parquet.
Supported Values
ccy:eur,usdformat:parquet,json,csvperiods:one-day,one-week,week-to-date,one-month,month-to-date,three-month,three-month-to-date,six-month,six-month-to-date,one-year,year-to-date,one-year-to-date,three-year,three-year-to-date
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file trackinsight_data_python-0.1.8.tar.gz.
File metadata
- Download URL: trackinsight_data_python-0.1.8.tar.gz
- Upload date:
- Size: 11.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6fe2c01cd9318844fa26a03282c45256f39ac6d399f51533102c1539088fbe4f
|
|
| MD5 |
fbed1d7a3d2d4f82f230d51186e7b291
|
|
| BLAKE2b-256 |
065e991dca95e129d8cea5c0a6e60963c1e4584971b374ec40600ef91f430030
|
File details
Details for the file trackinsight_data_python-0.1.8-py3-none-any.whl.
File metadata
- Download URL: trackinsight_data_python-0.1.8-py3-none-any.whl
- Upload date:
- Size: 11.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
bb15783c9c2bc53e1fda4084a29fd3b2251e444affa063e23adec7c816df278e
|
|
| MD5 |
7c9e2b7c7bc5ddd998e6d55d138f13d3
|
|
| BLAKE2b-256 |
073587498157c5bfb4960d8213060cd63ba13194177dd7f89a62bc799782b10c
|