Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Text mining
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
== Text analytics == {{see also|List of text mining methods}} '''Text analytics''' describes a set of [[linguistics|linguistic]], [[statistical]], and [[machine learning]] techniques that model and structure the information content of textual sources for [[business intelligence]], [[exploratory data analysis]], [[research]], or investigation.<ref>[http://intelligent-enterprise.informationweek.com/blog/archives/2007/02/defining_text_a.html] {{webarchive|url=https://web.archive.org/web/20091129171151/http://intelligent-enterprise.informationweek.com/blog/archives/2007/02/defining_text_a.html|date=November 29, 2009}}</ref> The term is roughly synonymous with text mining; indeed, [[Ronen Feldman]] modified a 2000 description of "text mining"<ref>{{cite web|url=https://www.cs.cmu.edu/~dunja/CFPWshKDD2000.html |title=KDD-2000 Workshop on Text Mining β Call for Papers |publisher=Cs.cmu.edu |access-date=2015-02-23}}</ref> in 2004 to describe "text analytics".<ref>[http://www.ir.iit.edu/cikm2004/tutorials.html#T2] {{webarchive|url=https://web.archive.org/web/20120303042253/http://www.ir.iit.edu/cikm2004/tutorials.html#T2|date=March 3, 2012}}</ref> The latter term is now used more frequently in business settings while "text mining" is used in some of the earliest application areas, dating to the 1980s,<ref>{{cite book |doi=10.3115/991813.991833 |title=Proceedings of the 9th conference on Computational linguistics |year=1982 |last1=Hobbs |first1=Jerry R. |last2=Walker |first2=Donald E. |last3=Amsler |first3=Robert A. |volume=1 |pages=127β32|chapter=Natural language access to structured text |s2cid=6433117 }}</ref> notably life-sciences research and government intelligence. The term text analytics also describes that application of text analytics to respond to business problems, whether independently or in conjunction with query and analysis of fielded, numerical data. It is a truism that 80% of business-relevant information originates in [[unstructured data|unstructured]] form, primarily text.<ref name="breakthroughanalysis1">{{cite web|url=http://breakthroughanalysis.com/2008/08/01/unstructured-data-and-the-80-percent-rule/ |title=Unstructured Data and the 80 Percent Rule |work=Breakthrough Analysis |access-date=2015-02-23|date=August 2008 }}</ref> These techniques and processes discover and present knowledge β facts, [[business rule]]s, and relationships β that is otherwise locked in textual form, impenetrable to automated processing.
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)