Releases: lincc-frameworks/hyrax
v0.7.0
What's Changed
- Adding "Open in Colab" button for getting started page by @drewoldag in #590
- Removed return type hint by @drewoldag in #591
- Adding Doc Pages For Config System and Data Flow by @maxwest-uw in #594
- Adding links to the most useful of the old reference docs by @mtauraso in #595
- Expand description of UMAP by @gitosaurus in #592
- Adding Core concepts page by @drewoldag in #596
- Fixing typos on the core concepts page by @drewoldag in #598
- Migrate documentation from sphinx_tabs to sphinx-design by @Copilot in #600
- Rename model_inputs to data_request with deprecation handling by @Copilot in #606
- Move NaN handling from training loop to DataProvider.collate() by @Copilot in #615
- Fix _get_available_bands_from_manifest to find complete band entries by @aritraghsh09 in #629
- Removing Python 3.10 support by @mtauraso in #638
- Data Caching for tensors by @mtauraso in #593
- Remove python 3.10 compatibility pin by @mtauraso in #641
- Adding example notebook showing how configs work by @drewoldag in #631
- Clean up engine.py docstring and remove sporadic tilde characters from comments by @Copilot in #632
- Add test verb to evaluate models and save outputs for analysis by @Copilot in #630
- Rename to_tensor to prepare_inputs with backward compatibility and reproducibility fixes by @Copilot in #607
- Implement safer pattern for data key access in HyraxAutoencoderV2 by @Copilot in #650
- Bringing Unsupervised Models Back in Line With Main by @aritraghsh09 in #647
- Use Pydantic to wrap dictionary requests to Hyrax by @gitosaurus in #622
- Fix bug in Pydantic validation by @gitosaurus in #661
- Introducing verb-specific batch functions in models to fix the
torch.no_gradissue with thetestverb by @drewoldag in #664 - Tune Pydantic validation to user purpose by @gitosaurus in #665
- Bump dawidd6/action-download-artifact from 12 to 13 by @dependabot[bot] in #660
- Setting preload_cache default to
falseby @drewoldag in #670 - Use HSC1k dataset for DataCache ASV benchmark by @Copilot in #674
- Refactor: Extract duplicated code from test and infer verbs into shared utilities by @Copilot in #646
- Only call load_to_tensor if load_prepare_inputs returns None by @Copilot in #682
- Remove legacy [data_set] configuration fallback by @Copilot in #678
- LR Scheduler implemented by @SamSandwich07 in #652
- Fixing MLFlow Tracking Directory by @aritraghsh09 in #693
- Design for integrating LanceDB into Hyrax by @gitosaurus in #683
- Clearer mlflow directory by @mtauraso in #695
- Implement Lance columnar storage for inference results by @Copilot in #699
- Use the scheduler on the inner model, not the wrapped model by @mtauraso in #706
- 'data' key not required; can be any friendly name. by @gitosaurus in #710
visualizeverb should find all its inputs from config. by @gitosaurus in #713- Lr scheduler notebook by @SamSandwich07 in #702
- Get pre-executed notebooks running again, with needed Pydantic and Lance changes by @gitosaurus in #711
- Fixing up model wrapping and tracking of metrics code, learning scheduler by @mtauraso in #707
- Working with
ResultsDatasetcommon workflow notebook by @drewoldag in #717 - Moving precache threads to config by @mtauraso in #673
- Updated
data_requestto support splits by @drewoldag in #690 - Mtauraso robot instructions by @mtauraso in #679
- [AUTO-MERGE] Rename
REQUIRED_SPLITS/OPTIONAL_SPLITStoREQUIRED_DATA_GROUPS/OPTIONAL_DATA_GROUPSby @Copilot in #723 - Removing the requirement for Datasets to implement ids() by @mtauraso in #704
- Remove deprecated
data_set.nameconfig key and references by @Copilot in #729 - Bump actions/upload-artifact from 6 to 7 by @dependabot[bot] in #740
- Bump dawidd6/action-download-artifact from 13 to 16 by @dependabot[bot] in #739
- [AUTO-MERGE] Removed
Datasetfrom ResultDataset class inheritance by @drewoldag in #735 - [AUTO-MERGE] Testing to see if we can remove the numba pinning. by @drewoldag in #745
- [AUTO_MERGE] Remove if/else from
dist_data_loaderfor collate_func by @Copilot in #727 - [AUTO-MERGE] Handle None values in _inner_loop tensor conversion by @Copilot in #689
- [AUTO-MERGE] Removed unused input argument in
create_results_writerby @drewoldag in #733 - [AUTO-MERGE] Second round of removing iterable dataset references and support. by @drewoldag in #738
- [AUTO-MERGE] Adding two notebooks about working with results to Common Workflows. by @drewoldag in #748
- Removed nan-handlers for torch tensors. Updated the unit tests. by @drewoldag in #751
- [AUTO-MERGE] Add list_models() and list_dataset_classes() to Hyrax by @Copilot in #747
- [AUTO-MERGE] Adding lance-to-parquet conversion notebook. by @drewoldag in #754
- [AUTO-MERGE] Enable training from pre-trained model weights (fine-tuning support) by @Copilot in #752
- Checkpointing Notebook by @SamSandwich07 in #749
- [AUTO-MERGE] Pass a deepcopy of config to verbs and Hyrax methods by @drewoldag in #757
- [AUTO-MERGE] Polished existing vector database notebook, added to common workflows by @drewoldag in #753
- [AUTO-MERGE] Fix typo in documentation regarding infrastructure by @drewoldag in #769
- [AUTO-MERGE] Remove Python 3.10 from supported versions in getting started docs by @Copilot in #766
- [AUTO-MERGE] science example notebook - unsupervised image-based by @drewoldag in #759
- [AUTO-MERGE] Enhance README with project details and installation guide by @drewoldag in #768
- Load pretrained weights notebook and bug by @drewoldag in #770
- Adding excalidraw diagrams. by @drewoldag in #772
- fix example notebook by @ztq1996 in #776
- [AUTO-MERGE] Fix load_pretrained_model notebook to not execute during doc build by @drewoldag in #775
- Cleaned up HyraxCifarDataset by @drewoldag in #774
- Remove unused
pynvmldependency to eliminate PyTorch deprecation warning by @Copilot in #779 - [AUTO-MERGE] Add list of external libraries that we know will be called and do not have default configs files. by @drewoldag in #785
- Hyperparameter tuning example notebook by @drewoldag in #777
- [AUTO-MERGE] Updating the MLFlow backend from file to sqlite-based. by @drewoldag in #781
- Adding LanceDB pin to fix tests with NaNs inserted into ResultDatasets by @mtauraso in #799
- Update comment to reflect bug by @mtauraso in https://github.com/lincc-frameworks/h...
v0.6.10
What's Changed
- Updating docstrings so that sphinx will build the documentation. by @drewoldag in #581
- Bump dawidd6/action-download-artifact from 11 to 12 by @dependabot[bot] in #580
- Bump actions/upload-artifact from 5 to 6 by @dependabot[bot] in #578
- Bump actions/cache from 4 to 5 by @dependabot[bot] in #577
- Bump astral-sh/setup-uv from 5 to 7 by @dependabot[bot] in #579
- Adding a first draft of the landing page, by @drewoldag in #584
- LSST dataset improvements by @mtauraso in #543
- Some tidying up of introductory documentation by @gitosaurus in #587
- Getting started documentation page + CIFAR/CNN updates by @drewoldag in #586
New Contributors
- @gitosaurus made their first contribution in #587
Full Changelog: v0.6.9...v0.6.10
v0.6.9
What's Changed
- Use simple model.onnx file name and save a copy of config by @drewoldag in #574
Full Changelog: v0.6.8...v0.6.9
v0.6.8
What's Changed
- Model updates rebased to main by @aritraghsh09 in #535
- Implementation of
to_onnxverb. by @drewoldag in #527 - Add semi-hidden feature to allow recording metrics at the end of each training epoch by @drewoldag in #540
- Potential fix for ONNX-ify problem by @drewoldag in #541
- Bump actions/checkout from 5 to 6 by @dependabot[bot] in #542
- Fix GPU-trained models failing to load on CPU-only machines by @Copilot in #486
- Fixes a bug in the way that we load a checkpoint file to resume a training run. by @drewoldag in #544
- Initial implementation of a CSV dataset class by @drewoldag in #477
- Updating to the latest copier version. by @drewoldag in #545
- Load/save to_tensor during inference by @drewoldag in #546
- Initial scaffolding for
engineverb by @drewoldag in #550 - seperate
onnxoutput frompytorch.igniteby @maxwest-uw in #558 - Default collate function built in DataProvider by @drewoldag in #557
- Adding InferenceDatasetWriter to engine to persist results. by @drewoldag in #559
- Use the new, non-torch-based, collate function in
engineby @drewoldag in #560 - Find and use custom collate functions defined on dataset classes by @drewoldag in #561
- Refactor ONNX convert to work for models with different input types by @drewoldag in #564
- Fixing issues encountered while trying to to-onnx applecider models. by @drewoldag in #565
Full Changelog: v0.6.7...v0.6.8
v0.6.7
What's Changed
- Add PyPi badge by @jeremykubica in #526
- Bump peter-evans/find-comment from 3 to 4 by @dependabot[bot] in #522
- Bump actions/upload-artifact from 4 to 5 by @dependabot[bot] in #523
- Bump peter-evans/create-or-update-comment from 4 to 5 by @dependabot[bot] in #524
- Update pytorch-ignite requirement from <=0.5.2 to <=0.5.3 by @dependabot[bot] in #525
- Update automatic criterion and optimizer machinery for transparency by @drewoldag in #529
- Explicitly use
forkin InferenceDatasetWriter by @drewoldag in #533
New Contributors
- @jeremykubica made their first contribution in #526
Full Changelog: v0.6.6...v0.6.7
v0.6.6
What's Changed
- Adding
onnxscriptdependency. by @drewoldag in #490 - Attach final metrics to model returned from
h.train(). by @drewoldag in #485 - Allow users to provide an externally defined collate function to PyTorch's DataLoader by @drewoldag in #492
- Reduce verbosity of model logging by @drewoldag in #494
- Enabling distinct splits to be defined for training by @drewoldag in #487
- Add helpful error for empty model_inputs and rename DATA_SET_REGISTRY to DATASET_REGISTRY by @Copilot in #514
- Move model parameters to model-specific config subtables by @Copilot in #516
- Set model.name to false in default config and add informative error by @Copilot in #513
- Remove automatic ONNX export from training by @Copilot in #520
Full Changelog: v0.6.5...v0.6.6
v0.6.5
What's Changed
- Updating built-in models that are not already using
data_sampleby @drewoldag in #474 - Fix
set_configmethod to handle dotted quoted strings in TOML table names by @Copilot in #484
Full Changelog: v0.6.4...v0.6.5
v0.6.4
What's Changed
- Pulling
coloramain as a dependency to highlight the model and data being used for training and inference. by @drewoldag in #478 - Allow user defined metadata fields in HyraxRandomDataset. by @drewoldag in #482
- Fix NaN handling for tuples with length > 2 by @Copilot in #481
- Remove unused ConfigDict class and references by @Copilot in #475
Full Changelog: v0.6.3...v0.6.4
v0.6.3
Biggest changes
tl;dr;
Users should now use h.set_config('path.to.config.key', <value>) instead of h.config['path']['to']['config']['key'] = <value> when modifying the configuration in a notebook.
the deets
We've fixed a bug in the configuration system that prevented default configuration files from being loaded properly when updating the configuration in a notebook. We've introduced the new set_config method to increase transparency of the action, instead of trying to expand the automagical functionality that existed before.
What's Changed
- Lowering many logger.INFOs to logger.DEBUGs. by @drewoldag in #445
- Update model_weights_file config when auto-detected and persist in runtime config by @Copilot in #442
- Fix KeyError when data_location is not defined for iterable datasets and add registry validation by @Copilot in #444
- Updating example notebooks for
model_inputs. by @drewoldag in #437 - Fix config file validation to fail for non-existent config files by @Copilot in #452
- Fix vector db benchmarks data access after HyraxQL release by @Copilot in #455
- Fix Parser help when config file is specified early on cli. Present error on nonexistant config. by @mtauraso in #453
- Updated model_inputs 1 and 2 based on PR feedback. by @drewoldag in #461
- Bump asv from 0.6.4 to 0.6.5 by @dependabot[bot] in #464
- Bump actions/setup-python from 5 to 6 by @dependabot[bot] in #465
- Use of
fetch_dataset_classinstead of accessing DATA_SET_REGISTRY by @drewoldag in #462 - Working to make ConfigManager easier to use by @drewoldag in #468
- Fix vector DB benchmarks to use model_inputs configuration by @Copilot in #470
Full Changelog: v0.6.2...v0.6.3
v0.6.2
What's Changed
- Add comprehensive GitHub Copilot instructions for Hyrax development by @Copilot in #426
- Fix KeyError when primary_id_field is not included in fields list by @Copilot in #436
- Fix DataProvider.resolve_data performance regression by @drewoldag in #440
Full Changelog: v0.6.1...v0.6.2