Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Key Word in Context
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
{{Short description|Common format for concordance lines}} '''Key Word In Context''' ('''KWIC''') is the most common format for [[concordance (publishing)|concordance]] lines. The term KWIC was coined by [[Hans Peter Luhn]].<ref>{{cite book |last1=Manning |first1=C. D. |last2=Schütze |first2=H. |title=Foundations of Statistical Natural Language Processing |page=35 |publisher=The MIT Press |year=1999}}</ref> The system was based on a concept called ''keyword in titles'', which was first proposed for Manchester libraries in 1864 by [[Andrea Crestadoro]].<ref name="index">{{cite web|url=https://books.google.com/books?id=nIUkl7bLzYUC&dq=Andrea+Crestadoro&pg=PA41|title=Advanced Indexing and Abstracting Practices|publisher=Atlantic Publishers & Distributors|year=1989|page=41|access-date=26 March 2019|via=Google Books}}</ref> A KWIC index is formed by sorting and aligning the words within an article title to allow each word (except the [[stop words]]) in titles to be searchable alphabetically in the index.<ref>{{Cite web |url=http://www.janda.org/workshop/content%20analysis/kwic.htm |title=KWIC indexes and concordances |access-date=2016-06-17 |archive-url=https://web.archive.org/web/20160606141149/http://www.janda.org/workshop/content%20analysis/kwic.htm |archive-date=2016-06-06 |url-status=dead }}</ref> It was a useful indexing method for technical manuals before computerized [[full text search]] became common. For example, a search query including all of the words in an example definition ("KWIC is an acronym for Key Word In Context, the most common format for concordance lines") and the Wikipedia slogan in English ("the free encyclopedia"), searched against a Wikipedia page, might yield a KWIC index as follows. A KWIC index usually uses a wide layout to allow the display of maximum 'in context' information (not shown in the following example). {| nowrap |- |align=right|KWIC is an |'''acronym''' for Key Word In Context, ... |page 1 |- |align=right|... Key Word In Context, the most |'''common''' format for concordance lines. |page 1 |- |align=right|... the most common format for |'''concordance''' lines. |page 1 |- |align=right|... is an acronym for Key Word In |'''Context''', the most common format ... |page 1 |- |align=right|Wikipedia, The Free |'''Encyclopedia''' |page 0 |- |align=right|... In Context, the most common |'''format''' for concordance lines. |page 1 |- |align=right|Wikipedia, The |'''Free''' Encyclopedia |page 0 |- |align=right|KWIC is an acronym for |'''Key''' Word In Context, the most ... |page 1 |- | |'''KWIC''' is an acronym for Key Word ... |page 1 |- |align=right|... common format for concordance |'''lines'''. |page 1 |- |align=right|... for Key Word In Context, the |'''most''' common format for concordance ... |page 1 |- | |'''Wikipedia''', The Free Encyclopedia |page 0 |- |align=right|KWIC is an acronym for Key |'''Word''' In Context, the most common ... |page 1 |} A KWIC index is a special case of a ''permuted index''.<ref>{{cite web|url=http://infohost.nmt.edu/tcc/help/lang/python/examples/kwic/web/theory.html|title=3. Theory of KWIC indexing|website=Infohost.nmt.edu|access-date=26 March 2019|archive-date=14 May 2019|archive-url=https://web.archive.org/web/20190514012850/http://infohost.nmt.edu/tcc/help/lang/python/examples/kwic/web/theory.html|url-status=dead}}</ref> This term refers to the fact that it indexes all [[cyclic permutation]]s of the headings. Books composed of many short sections with their own descriptive headings, most notably collections of [[Manual page (Unix)|manual pages]], often ended with a '''permuted index''' section, allowing the reader to easily find a section by any word from its heading. This practice, also known as '''Key Word Out of Context''' ('''KWOC'''), is no longer common. <gallery> KWAC.png|Keyword alongside context (KWAC) KWIC.png|Keyword in context (KWIC) KWOC.png|Keyword out of context (KWOC) </gallery> ==References in literature== ''Note: The first reference does not show the KWIC index unless you pay to view the paper. The second reference does not even list the paper at all.'' * [[David Parnas|David L. Parnas]] uses a KWIC index as an example on how to perform modular design in his paper [http://portal.acm.org/citation.cfm?id=361623&coll=ACM&dl=ACM&CFID=9516243&CFTOKEN=98251202 ''On the Criteria To Be Used in Decomposing Systems into Modules''], available as an [https://web.archive.org/web/20070820133530/http://www.acm.org/classics/may96/ ACM Classic Paper] * Christopher D. Manning and Hinrich Schütze describe a KWIC index and computer concordancing in section 1.4.5 of their book ''Foundations of Statistical Natural Language Processing''. Cambridge, Mass: MIT Press, 1999. {{ISBN|9780262133609}}. They cite an article from H.P. Luhn from 1960, "Key word-in-context index for technical literature (kwic index)". * According to Rev. Gerard O'Connor's ''[http://www.clerus.org/clerus/dati/2004-09/13-13/CIMR1.htm Concordantia et Indices Missalium Romanorum]'', "Most of the concordances produced in recent times and with the aid of computer software use both the KWIC (keyword in context) and KWICn (keyword in center) formats, which lists the keyword, usually highlighted in bold text in a consistent position, within a limited amount of context text, i.e. three [or] four words of the text prior to the keyword and the same amount of text following. This format is extremely useful in that the keyword is easily identified together with its context. ... The Concordance of the Roman Missal is produced in both the KWIC and KWICn formats and is noteworthy in that each word form is listed as it appears in the text, that is, it is [[Lemma (morphology)|un-lemmatized]]." ==See also== * <code>[[Ptx (Unix)|ptx]]</code>, a Unix command-line utility producing a permuted index *[[Concordancer]] *[[Burrows–Wheeler transform]] *[[Suffix tree]] ==References== {{Reflist}} [[Category:Index (publishing)]] [[Category:Reference]] [[Category:Concordances (publishing)]]
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)
Pages transcluded onto the current version of this page
(
help
)
:
Template:Cite book
(
edit
)
Template:Cite web
(
edit
)
Template:ISBN
(
edit
)
Template:Reflist
(
edit
)
Template:Short description
(
edit
)