Skip to content

Releases: lincc-frameworks/hyrax

v0.7.0

01 Apr 19:09
e95bc49

Choose a tag to compare

What's Changed

  • Adding "Open in Colab" button for getting started page by @drewoldag in #590
  • Removed return type hint by @drewoldag in #591
  • Adding Doc Pages For Config System and Data Flow by @maxwest-uw in #594
  • Adding links to the most useful of the old reference docs by @mtauraso in #595
  • Expand description of UMAP by @gitosaurus in #592
  • Adding Core concepts page by @drewoldag in #596
  • Fixing typos on the core concepts page by @drewoldag in #598
  • Migrate documentation from sphinx_tabs to sphinx-design by @Copilot in #600
  • Rename model_inputs to data_request with deprecation handling by @Copilot in #606
  • Move NaN handling from training loop to DataProvider.collate() by @Copilot in #615
  • Fix _get_available_bands_from_manifest to find complete band entries by @aritraghsh09 in #629
  • Removing Python 3.10 support by @mtauraso in #638
  • Data Caching for tensors by @mtauraso in #593
  • Remove python 3.10 compatibility pin by @mtauraso in #641
  • Adding example notebook showing how configs work by @drewoldag in #631
  • Clean up engine.py docstring and remove sporadic tilde characters from comments by @Copilot in #632
  • Add test verb to evaluate models and save outputs for analysis by @Copilot in #630
  • Rename to_tensor to prepare_inputs with backward compatibility and reproducibility fixes by @Copilot in #607
  • Implement safer pattern for data key access in HyraxAutoencoderV2 by @Copilot in #650
  • Bringing Unsupervised Models Back in Line With Main by @aritraghsh09 in #647
  • Use Pydantic to wrap dictionary requests to Hyrax by @gitosaurus in #622
  • Fix bug in Pydantic validation by @gitosaurus in #661
  • Introducing verb-specific batch functions in models to fix the torch.no_grad issue with the test verb by @drewoldag in #664
  • Tune Pydantic validation to user purpose by @gitosaurus in #665
  • Bump dawidd6/action-download-artifact from 12 to 13 by @dependabot[bot] in #660
  • Setting preload_cache default to false by @drewoldag in #670
  • Use HSC1k dataset for DataCache ASV benchmark by @Copilot in #674
  • Refactor: Extract duplicated code from test and infer verbs into shared utilities by @Copilot in #646
  • Only call load_to_tensor if load_prepare_inputs returns None by @Copilot in #682
  • Remove legacy [data_set] configuration fallback by @Copilot in #678
  • LR Scheduler implemented by @SamSandwich07 in #652
  • Fixing MLFlow Tracking Directory by @aritraghsh09 in #693
  • Design for integrating LanceDB into Hyrax by @gitosaurus in #683
  • Clearer mlflow directory by @mtauraso in #695
  • Implement Lance columnar storage for inference results by @Copilot in #699
  • Use the scheduler on the inner model, not the wrapped model by @mtauraso in #706
  • 'data' key not required; can be any friendly name. by @gitosaurus in #710
  • visualize verb should find all its inputs from config. by @gitosaurus in #713
  • Lr scheduler notebook by @SamSandwich07 in #702
  • Get pre-executed notebooks running again, with needed Pydantic and Lance changes by @gitosaurus in #711
  • Fixing up model wrapping and tracking of metrics code, learning scheduler by @mtauraso in #707
  • Working with ResultsDataset common workflow notebook by @drewoldag in #717
  • Moving precache threads to config by @mtauraso in #673
  • Updated data_request to support splits by @drewoldag in #690
  • Mtauraso robot instructions by @mtauraso in #679
  • [AUTO-MERGE] Rename REQUIRED_SPLITS/OPTIONAL_SPLITS to REQUIRED_DATA_GROUPS/OPTIONAL_DATA_GROUPS by @Copilot in #723
  • Removing the requirement for Datasets to implement ids() by @mtauraso in #704
  • Remove deprecated data_set.name config key and references by @Copilot in #729
  • Bump actions/upload-artifact from 6 to 7 by @dependabot[bot] in #740
  • Bump dawidd6/action-download-artifact from 13 to 16 by @dependabot[bot] in #739
  • [AUTO-MERGE] Removed Dataset from ResultDataset class inheritance by @drewoldag in #735
  • [AUTO-MERGE] Testing to see if we can remove the numba pinning. by @drewoldag in #745
  • [AUTO_MERGE] Remove if/else from dist_data_loader for collate_func by @Copilot in #727
  • [AUTO-MERGE] Handle None values in _inner_loop tensor conversion by @Copilot in #689
  • [AUTO-MERGE] Removed unused input argument in create_results_writer by @drewoldag in #733
  • [AUTO-MERGE] Second round of removing iterable dataset references and support. by @drewoldag in #738
  • [AUTO-MERGE] Adding two notebooks about working with results to Common Workflows. by @drewoldag in #748
  • Removed nan-handlers for torch tensors. Updated the unit tests. by @drewoldag in #751
  • [AUTO-MERGE] Add list_models() and list_dataset_classes() to Hyrax by @Copilot in #747
  • [AUTO-MERGE] Adding lance-to-parquet conversion notebook. by @drewoldag in #754
  • [AUTO-MERGE] Enable training from pre-trained model weights (fine-tuning support) by @Copilot in #752
  • Checkpointing Notebook by @SamSandwich07 in #749
  • [AUTO-MERGE] Pass a deepcopy of config to verbs and Hyrax methods by @drewoldag in #757
  • [AUTO-MERGE] Polished existing vector database notebook, added to common workflows by @drewoldag in #753
  • [AUTO-MERGE] Fix typo in documentation regarding infrastructure by @drewoldag in #769
  • [AUTO-MERGE] Remove Python 3.10 from supported versions in getting started docs by @Copilot in #766
  • [AUTO-MERGE] science example notebook - unsupervised image-based by @drewoldag in #759
  • [AUTO-MERGE] Enhance README with project details and installation guide by @drewoldag in #768
  • Load pretrained weights notebook and bug by @drewoldag in #770
  • Adding excalidraw diagrams. by @drewoldag in #772
  • fix example notebook by @ztq1996 in #776
  • [AUTO-MERGE] Fix load_pretrained_model notebook to not execute during doc build by @drewoldag in #775
  • Cleaned up HyraxCifarDataset by @drewoldag in #774
  • Remove unused pynvml dependency to eliminate PyTorch deprecation warning by @Copilot in #779
  • [AUTO-MERGE] Add list of external libraries that we know will be called and do not have default configs files. by @drewoldag in #785
  • Hyperparameter tuning example notebook by @drewoldag in #777
  • [AUTO-MERGE] Updating the MLFlow backend from file to sqlite-based. by @drewoldag in #781
  • Adding LanceDB pin to fix tests with NaNs inserted into ResultDatasets by @mtauraso in #799
  • Update comment to reflect bug by @mtauraso in https://github.com/lincc-frameworks/h...
Read more

v0.6.10

15 Jan 00:51
5b8e5ca

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.6.9...v0.6.10

v0.6.9

18 Dec 19:25
bda3e28

Choose a tag to compare

What's Changed

  • Use simple model.onnx file name and save a copy of config by @drewoldag in #574

Full Changelog: v0.6.8...v0.6.9

v0.6.8

18 Dec 17:56
ce90e25

Choose a tag to compare

What's Changed

  • Model updates rebased to main by @aritraghsh09 in #535
  • Implementation of to_onnx verb. by @drewoldag in #527
  • Add semi-hidden feature to allow recording metrics at the end of each training epoch by @drewoldag in #540
  • Potential fix for ONNX-ify problem by @drewoldag in #541
  • Bump actions/checkout from 5 to 6 by @dependabot[bot] in #542
  • Fix GPU-trained models failing to load on CPU-only machines by @Copilot in #486
  • Fixes a bug in the way that we load a checkpoint file to resume a training run. by @drewoldag in #544
  • Initial implementation of a CSV dataset class by @drewoldag in #477
  • Updating to the latest copier version. by @drewoldag in #545
  • Load/save to_tensor during inference by @drewoldag in #546
  • Initial scaffolding for engine verb by @drewoldag in #550
  • seperate onnx output from pytorch.ignite by @maxwest-uw in #558
  • Default collate function built in DataProvider by @drewoldag in #557
  • Adding InferenceDatasetWriter to engine to persist results. by @drewoldag in #559
  • Use the new, non-torch-based, collate function in engine by @drewoldag in #560
  • Find and use custom collate functions defined on dataset classes by @drewoldag in #561
  • Refactor ONNX convert to work for models with different input types by @drewoldag in #564
  • Fixing issues encountered while trying to to-onnx applecider models. by @drewoldag in #565

Full Changelog: v0.6.7...v0.6.8

v0.6.7

12 Nov 22:08
00cc1c2

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.6.6...v0.6.7

v0.6.6

31 Oct 21:55
180ecc3

Choose a tag to compare

What's Changed

  • Adding onnxscript dependency. by @drewoldag in #490
  • Attach final metrics to model returned from h.train(). by @drewoldag in #485
  • Allow users to provide an externally defined collate function to PyTorch's DataLoader by @drewoldag in #492
  • Reduce verbosity of model logging by @drewoldag in #494
  • Enabling distinct splits to be defined for training by @drewoldag in #487
  • Add helpful error for empty model_inputs and rename DATA_SET_REGISTRY to DATASET_REGISTRY by @Copilot in #514
  • Move model parameters to model-specific config subtables by @Copilot in #516
  • Set model.name to false in default config and add informative error by @Copilot in #513
  • Remove automatic ONNX export from training by @Copilot in #520

Full Changelog: v0.6.5...v0.6.6

v0.6.5

15 Oct 03:55
5a4a8c3

Choose a tag to compare

What's Changed

  • Updating built-in models that are not already using data_sample by @drewoldag in #474
  • Fix set_config method to handle dotted quoted strings in TOML table names by @Copilot in #484

Full Changelog: v0.6.4...v0.6.5

v0.6.4

10 Oct 03:32
371e961

Choose a tag to compare

What's Changed

  • Pulling colorama in as a dependency to highlight the model and data being used for training and inference. by @drewoldag in #478
  • Allow user defined metadata fields in HyraxRandomDataset. by @drewoldag in #482
  • Fix NaN handling for tuples with length > 2 by @Copilot in #481
  • Remove unused ConfigDict class and references by @Copilot in #475

Full Changelog: v0.6.3...v0.6.4

v0.6.3

08 Oct 16:21
6951aff

Choose a tag to compare

Biggest changes

tl;dr;
Users should now use h.set_config('path.to.config.key', <value>) instead of h.config['path']['to']['config']['key'] = <value> when modifying the configuration in a notebook.

the deets
We've fixed a bug in the configuration system that prevented default configuration files from being loaded properly when updating the configuration in a notebook. We've introduced the new set_config method to increase transparency of the action, instead of trying to expand the automagical functionality that existed before.

What's Changed

  • Lowering many logger.INFOs to logger.DEBUGs. by @drewoldag in #445
  • Update model_weights_file config when auto-detected and persist in runtime config by @Copilot in #442
  • Fix KeyError when data_location is not defined for iterable datasets and add registry validation by @Copilot in #444
  • Updating example notebooks for model_inputs. by @drewoldag in #437
  • Fix config file validation to fail for non-existent config files by @Copilot in #452
  • Fix vector db benchmarks data access after HyraxQL release by @Copilot in #455
  • Fix Parser help when config file is specified early on cli. Present error on nonexistant config. by @mtauraso in #453
  • Updated model_inputs 1 and 2 based on PR feedback. by @drewoldag in #461
  • Bump asv from 0.6.4 to 0.6.5 by @dependabot[bot] in #464
  • Bump actions/setup-python from 5 to 6 by @dependabot[bot] in #465
  • Use of fetch_dataset_class instead of accessing DATA_SET_REGISTRY by @drewoldag in #462
  • Working to make ConfigManager easier to use by @drewoldag in #468
  • Fix vector DB benchmarks to use model_inputs configuration by @Copilot in #470

Full Changelog: v0.6.2...v0.6.3

v0.6.2

24 Sep 16:53
30ef6c7

Choose a tag to compare

What's Changed

  • Add comprehensive GitHub Copilot instructions for Hyrax development by @Copilot in #426
  • Fix KeyError when primary_id_field is not included in fields list by @Copilot in #436
  • Fix DataProvider.resolve_data performance regression by @drewoldag in #440

Full Changelog: v0.6.1...v0.6.2