View text source at Wikipedia


Orange (software)

Orange
Developer(s)University of Ljubljana
Initial release10 October 1996; 28 years ago (1996-10-10)[1]
Stable release
3.38.1[2] / 23 December 2024; 16 days ago (23 December 2024)
RepositoryOrange Repository
Written inPython, Cython, C++, C
Operating systemCross-platform
TypeMachine learning, Data mining, Data visualization, Data analysis
LicenseGPLv3 or later[3][4]
Websiteorangedatamining.com Edit this at Wikidata
A typical workflow in Orange .
A typical workflow in Orange 3.

Orange is an open-source data visualization, machine learning and data mining toolkit. It features a visual programming front-end for exploratory qualitative data analysis and interactive data visualization.

Classification Tree widget in Orange 3.

Description

[edit]

Orange is a component-based visual programming software package for data visualization, machine learning, data mining, and data analysis.

Orange components are called widgets. They range from simple data visualization, subset selection, and preprocessing to empirical evaluation of learning algorithms and predictive modeling.

Visual programming is implemented through an interface in which workflows are created by linking predefined or user-designed widgets, while advanced users can use Orange as a Python library for data manipulation and widget alteration.[5]

Software

[edit]

Orange is an open-source software package released under GPL and hosted on GitHub. Versions up to 3.0 include core components in C++ with wrappers in Python. From version 3.0 onwards, Orange uses common Python open-source libraries for scientific computing, such as numpy, scipy and scikit-learn, while its graphical user interface operates within the cross-platform Qt framework.

The default installation includes a number of machine learning, preprocessing and data visualization algorithms in 6 widget sets (data, transform, visualize, model, evaluate and unsupervised). Additional functionalities are available as add-ons (text-mining, image analytics, bioinformatics, etc.).

Orange is supported on macOS, Windows and Linux and can also be installed from the Python Package Index repository (pip install Orange3).

Features

[edit]

Orange consists of a canvas interface onto which the user places widgets and creates a data analysis workflow. Widgets offer basic functionalities such as reading the data, showing a data table, selecting features, training predictors, comparing learning algorithms, visualizing data elements, etc. The user can interactively explore visualizations or feed the selected subset into other widgets.

A decorated dendrogram in Orange 3
A decorated dendrogram in Orange 3.


Add-ons

[edit]

Orange users can extend their core set of components with components in the add-ons. Supported add-ons include:

Objectives

[edit]

The program provides a platform for experiment selection, recommendation systems, and predictive modelling and is used in biomedicine, bioinformatics, genomic research, and teaching. In science, it is used as a platform for testing new machine learning algorithms and for implementing new techniques in genetics and bioinformatics. In education, it was used for teaching machine learning and data mining methods to students of biology, biomedicine, and informatics.

Extensions

[edit]

Various projects build on Orange either by extending the core components with add-ons or using only the Orange Canvas to exploit the implemented visual programming features and GUI.

History

[edit]

In 1996, the University of Ljubljana and Jožef Stefan Institute started development of ML*, a machine learning framework in C++, and Python bindings were developed for this framework in 1997, which, together with emerging Python modules, formed a joint framework called Orange. Over the following years, most contemporary major algorithms for data mining and machine learning were implemented in C++ (Orange's core) or Python modules.

References

[edit]
  1. ^ "orange3/CHANGELOG.md at master . biolab/orange3 . GitHub". GitHub.
  2. ^ "Release 3.38.1". 23 December 2024. Retrieved 26 December 2024.
  3. ^ "Orange - License".
  4. ^ "orange3/LICENSE at master . biolab/orange3 . GitHub". GitHub.
  5. ^ Janez Demšar; Tomaž Curk; Aleš Erjavec; Črt Gorup; Tomaž Hočevar; Mitar Milutinovič; Martin Možina; Matija Polajnar; Marko Toplak; Anže Starič; Miha Stajdohar; Lan Umek; Lan Žagar; Jure Žbontar; Marinka Žitnik; Blaž Zupan (2013). "Orange: data mining toolbox in Python" (PDF). Journal of Machine Learning Research. 14 (1): 2349–2353.
  6. ^ Toplak, M.; Birarda, G.; Read, S.; Sandt, C.; Rosendahl, S. M.; Vaccari, L.; Demšar, J.; Borondics, F. (2017). "Infrared Orange: Connecting Hyperspectral Data with Machine Learning". Synchrotron Radiation News. 30 (4): 40–45. Bibcode:2017SRNew..30...40T. doi:10.1080/08940886.2017.1338424. S2CID 125273654.
  7. ^ Iomids (30 May 2024). "Checking AI for discrimination via GUI using the Orange Fairness Add-On". IOMIDS.
  8. ^ Sanchez Del Rio, Manuel; Rebuffi, Luca (2017). "OASYS (Or Ange SYnchrotron Suite): An open-source graphical environment for x-ray virtual experiments". In Chubar, Oleg; Sawhney, Kawal (eds.). Advances in Computational Methods for X-Ray Optics IV. p. 28. doi:10.1117/12.2274263. ISBN 9781510612334. S2CID 117118973.
  9. ^ Primož Godec; Matjaž Pančur; Nejc Ilenič; Andrej Čopar; Martin Stražar; Aleš Erjavec; Ajda Pretnar; Janez Demšar; Marko Toplak; Anže Starič; Lan Žagar; Jan Hartman; Hamilton Wang; Riccardo Bellazzi; Uroš Petrovič; Silvia Garagna; Maurizio Zuccotti; Dongsu Park; Gad Shaulsky; Blaž Zupan (2019). "Democratized image analytics by visual programming through integration of deep models and small-scale machine learning". Nature Communications. 10 (1): 4551. Bibcode:2019NatCo..10.4551G. doi:10.1038/s41467-019-12397-x. PMC 6779910. PMID 31591416. S2CID 203782491.
  10. ^ Marko Toplak; Stuart T. Read; Christophe Sandt; Ferenc Borondics (2021). "Quasar: Easy Machine Learning for Biospectroscopy". Cells. 10 (9): 2300. doi:10.3390/cells10092300. PMC 8466383. PMID 34571947.
  11. ^ "Orange3-Geo Documentation — Orange3-Geo documentation".
  12. ^ Martin Stražar; Lan Žagar; Jaka Kokošar; Vesna Tanko; Aleš Erjavec; Pavlin G. Poličar; Anže Starič; Janez Demšar; Gad Shaulsky; Vilas Menon; Andrew Lemire; Anup Parikh; Blaž Zupan (2021). "scOrange—a tool for hands-on training of concepts from single-cell data analytics". Bioinformatics. 35 (14): i4 – i12. doi:10.1093/bioinformatics/btz348. PMC 6612816. PMID 31510695.
  13. ^ "Orange Canvas Core". GitHub.

Further reading

[edit]
[edit]