A dataflow-based approach to the design and distribution of medical image analytics

Abstract

Machine Learning and imaging analytics are major algorithmic com-ponents of the software used by medical practitioners in the diagnosis and treatment of diseases. Whether employed by CADx (Computer Aided Diagnosis) or CBIR (Content Based Image Retrieval) tools, the accuracy and relevance of the results to the practitioner are paramount to the success of any such appli-cation. In order to improve on the existing results researchers often find themselves in the need to explore various approaches and methodologies, often using very large datasets and multiple sources of information. Each of these trials can, by itself, be a very time-consuming operation. One tried and true strategy to speed up operations is the use of a distributed computing platform (delivering the computational load to a number of machines). This raises a set of problems which are often orthogonal to a researcher’s interest such as which algorithmic implementations scale or how to distribute data and tasks on the grid. In this article we present a framework that empowers researchers to quickly design sets of tests, schedule their execution and have them automatically dis-tributed on a grid environment. We describe the design and implementation of the solution, and present as an example an experiment concerning the classification of mammography segmentations.

Authors

  • Frederico Valente
  • Augusto Silva
  • Carlos Costa
  • César Suárez Ortega
  • José Miguel Franco Valiente
  • Miguel Ángel Guevara López

More info

8th IBERGRID Infrastructure Conference, At Aveiro (Portugal), Volume: 1

Check the ResearchGate page