Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Data mining
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
==Software== {{Category see also|Data mining and machine learning software}} ===Free open-source data mining software and applications=== The following applications are available under free/open-source licenses. Public access to application source code is also available. * [[Carrot2]]: Text and search results clustering framework. * [[Chemicalize.org]]: A chemical structure miner and web search engine. * [[ELKI]]: A university research project with advanced [[cluster analysis]] and [[outlier detection]] methods written in the [[Java (programming language)|Java]] language. * [[General Architecture for Text Engineering|GATE]]: a [[natural language processing]] and language engineering tool. * [[KNIME]]: The Konstanz Information Miner, a user-friendly and comprehensive data analytics framework. * [[MOA (Massive Online Analysis)|Massive Online Analysis (MOA)]]: a real-time big data stream mining with concept drift tool in the [[Java (programming language)|Java]] programming language. * [[Multi expression programming|MEPX]]: cross-platform tool for regression and classification problems based on a Genetic Programming variant. * [[mlpack]]: a collection of ready-to-use machine learning algorithms written in the [[C++]] language. * [[NLTK]] ([[Natural Language Toolkit]]): A suite of libraries and programs for symbolic and statistical natural language processing (NLP) for the [[Python (programming language)|Python]] language. * [[OpenNN]]: Open [[Artificial neural network|neural networks]] library. * [[Orange (software)|Orange]]: A component-based data mining and [[machine learning]] software suite written in the [[Python (programming language)|Python]] language. *[[PSPP]]: Data mining and statistics software under the GNU Project similar to [[SPSS]] * [[R (programming language)|R]]: A [[programming language]] and software environment for [[statistical]] computing, data mining, and graphics. It is part of the [[GNU Project]]. * [[scikit-learn]]: An open-source machine learning library for the Python programming language; * [[Torch (machine learning)|Torch]]: An [[open-source]] [[deep learning]] library for the [[Lua (programming language)|Lua]] programming language and [[scientific computing]] framework with wide support for [[machine learning]] algorithms. * [[UIMA]]: The UIMA (Unstructured Information Management Architecture) is a component framework for analyzing unstructured content such as text, audio and video β originally developed by IBM. * [[Weka (machine learning)|Weka]]: A suite of machine learning software applications written in the [[Java (programming language)|Java]] programming language. ===Proprietary data-mining software and applications=== The following applications are available under proprietary licenses. * [[Angoss]] KnowledgeSTUDIO: data mining tool * [[LIONsolver]]: an integrated software application for data mining, business intelligence, and modeling that implements the Learning and Intelligent OptimizatioN (LION) approach. * [[PolyAnalyst]]: data and text mining software by Megaputer Intelligence. * [[Microsoft Analysis Services]]: data mining software provided by [[Microsoft]]. * [[NetOwl]]: suite of multilingual text and entity analytics products that enable data mining. * [[Oracle Data Mining]]: data mining software by [[Oracle Corporation]]. * [[PSeven]]: platform for automation of engineering simulation and analysis, multidisciplinary optimization and data mining provided by [[DATADVANCE]]. * [[Qlucore]] Omics Explorer: data mining software. * [[RapidMiner]]: An environment for [[machine learning]] and data mining experiments. <!-- Latest version is NOT opensource --> * [[SAS (software)#Components|SAS Enterprise Miner]]: data mining software provided by the [[SAS Institute]]. * [[SPSS Modeler]]: data mining software provided by [[IBM]]. * [[STATISTICA]] Data Miner: data mining software provided by [[StatSoft]]. * [[Tanagra (machine learning)|Tanagra]]: Visualisation-oriented data mining software, also for teaching. * [[Vertica]]: data mining software provided by [[Hewlett-Packard]]. * [[Google Cloud Platform]]: automated custom ML models managed by [[Google]]. * [[Amazon SageMaker]]: managed service provided by [[Amazon.com|Amazon]] for creating & productionising custom ML models.
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)