Transforms implementation#
How the transforms library that exists on Foundry is stubbed in Foundry DevTools#
Input datasets < 500MB are retrieved with the Foundry Dataproxy File Download API
Input datasets > 500MB are retrieved as partial subsets with the Foundry Dataproxy SQL API (SELECT * FROM … LIMIT)
Both of these limits can be changed with the corresponding config option
transforms_sql_dataset_size_threshold
size in MB, default 500transforms_sql_sample_row_limit
, default 5000
Datasets are cached locally in a folder (Linux: ~/.cache/foundry-dev-tools, macOS: ~/Library/Caches/foundry-dev-tools, Windows: %USERPROFILE%\AppData\Local\foundry-dev-tools\Cache)
Datasets are automatically passed to transform functions
Output datasets can be visualized using IDE or Notebook functionality
Output dataset files can be stored to a local folder for inspection
Foundry
transforms
library is stubbedDataset branch is detected automatically from the local git branch
The following sequence diagrams show
a) what happens when the Input
dataset is already cached.
b) what happens when the Input
dataset has a new transaction