Code indexing in gitaly is broken and leads to code not being visible to the user. We work on the issue with highest priority.

Snippets Groups Projects

Please migrate repositories to https://gitea.psi.ch - if you need help, contact gitea@psi.ch

trained interlaces

Fischer Robert authored 2 weeks ago

d001fb38

d001fb38 2 weeks ago

Name	Last commit	Last update
.gitignore
LICENSE
README.md
V2_example_notebook.ipynb
V2_feature_stack.py
V2_training.py
env_working_segmentation.txt
env_working_segmentation_explicit.txt
example_notebook.ipynb
filter_functions.py
interactive.py
segmentation.py
training_functions.py

PyTrainSeg - implementation of Weka-like 4D machine learning segmentation

Description

This collection of codes extends the idea of trainable Weka segmentation (TWS, https://imagej.net/plugins/tws/) to 4D. The implementation is in Python using dask for lazy evaluation and jupyterLab for interactive training. As any ML method, the 4D ML segmentation is computationally very heavy. Consider if it is really necessary.

Usage

This is a project in development. Image feature creation, training and segmentation are called from a main jupyter notebook. It might work if you execute the example notebook from top to bottom. Depending on your hardware and python environment, this might not work at all or crash eventually, but it has gotten much more stable in the recent past ;) I am working on a 64 cores 1 TB RAM 3TB SSD work station for my 4D-CT data (roughly 1700x150x800x70 voxel). I would start testing with a much smaller ROI.

Main concepts

The 4D image data has to be loaded by dask itself from h5df. Storing the data as netcdf4 (a normed h5df) on a SSD adds the convenience of using xarray for rapid data viewing. Dask creates chunkes (pieces) of the data which then allow the lazy loading and evaluation. Calculating features for the full dataset at once is way larger than available RAM and can only be executed in pieces (chunks). As in TWS, a set of features is created for every pixel by applying a bunch of image filters. The new addition is the employment of 4D filters (e.g. 4D Gaussian Blur) and time specific filters (e.g. minimum grayvalue over all time steps). By manually labeling parts of the image, a random forest classifier is trained to assign every pixel to the label classes based on its feature set.

Contact

The state of the project is basically a breadboard with wires sticking out and external devices patched with duct tape. The are too many limitations and bugs to list here at the moment. However, it works good enough for my inteded usage :). Feel free to use it, but I would really appreciate to learn of modifications you make since they might be helpful for me, too. I see potential in a better technical implementation, as well as in the selection and employment of clever alternative image filters, esp. in the time domain. If you want to dig into it, maybe you want to talk to me first for demonstration.

Cheers,

Robert Fischer

robert.fischer@psi.ch

Reference

PyTrainSeg was published alongside this paper:

@article{Fischer2024,
   author = {Fischer, R. and Dessiex, M. and Marone, F. and Büchi, Felix N.},
   title = {Gas-induced structural damages in bipolar membrane forward bias CO2 electrolysis studied by fast X-ray tomography},
   journal = {ACS Applied Energy Materials},
   DOI = {10.1021/acsaem.3c02882.},
   year = {2024},
   type = {Journal Article}
}

If you like this repository and use it for your work, please consider a citation.