Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Ecoinformatics
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
{{More references|date=October 2015}} '''Ecoinformatics''', or ecological informatics, is the [[Informatics (academic field)|science of information]] in [[ecology]] and [[environmental science]]. It integrates environmental and information sciences to define entities and natural processes with language common to both humans and computers. However, this is a rapidly developing area in ecology and there are alternative perspectives on what constitutes ecoinformatics. A few definitions have been circulating, mostly centered on the creation of tools to access and analyze natural system data. However, the scope and aims of ecoinformatics are certainly broader than the development of [[metadata]] standards to be used in documenting datasets. Ecoinformatics aims to facilitate environmental research and management by developing ways to access, integrate [[database]]s of environmental information, and develop new algorithms enabling different environmental datasets to be combined to test ecological hypotheses. Ecoinformatics is related to the concept of [[ecosystem service]]s.<ref name="Villa Ceroni Bagstad Johnson 2009">{{cite web | last1=Villa | first1=Ferdinando | last2=Ceroni | first2=Marta | last3=Bagstad | first3=Ken | last4=Johnson | first4=Gary | last5=Krivov | first5=Sergey | title=ARIES (ARtificial Intelligence for Ecosystem Services): A new tool for ecosystem services assessment, planning, and valuation | website=ResearchGate | date=2009-01-01 | url=https://www.researchgate.net/publication/228342331 | access-date=2022-01-23}}</ref> Ecoinformatics characterize the [[semantics]] of natural system knowledge. For this reason, much of today's ecoinformatics research relates to the branch of [[computer science]] known as [[knowledge representation]], and active ecoinformatics projects are developing links to activities such as the [[Semantic Web]]. Current initiatives to effectively manage, share, and reuse ecological data are indicative of the increasing importance of fields like ecoinformatics to develop the foundations for effectively managing ecological information. Examples of these initiatives are [[National Science Foundation]] [[Datanet]] projects, [[DataONE]], Data Conservancy, and [[Artificial Intelligence for Environment & Sustainability]].<ref name="Villa Ceroni Bagstad Johnson 2009"/> == Software Development Lifecycle == Central to the concept of ecoinformatics is the [[Systems development life cycle|Software Development Lifecycle]] (SDLC), a systematic framework for writing, implementing, and maintaining software products. Typically in Ecoinformatics projects, the development pipeline includes data collection, usually from several different environmental data sources, then integrating these data sources together, and then analyzing the data. Here, each step of the SDLC is described in the context of ecoinformatics, per Michener et al.<ref name=":0">{{Cite journal |last1=Michener |first1=William K. |last2=Jones |first2=Matthew B. |date=February 2012 |title=Ecoinformatics: supporting ecology as a data-intensive science |url=https://doi.org/10.1016/j.tree.2011.11.016 |journal=Trends in Ecology & Evolution |volume=27 |issue=2 |pages=85β93 |doi=10.1016/j.tree.2011.11.016 |pmid=22240191 |s2cid=12268743 |issn=0169-5347}}</ref> It is important to note that the plan, collect, assure, describes and preserve steps refer to the data collection entity, which can be individual researchers or large data-collection networks, while the discover, integrate, and analyze steps typically refer to the individual researcher. '''Plan''': Ecoinformatics projects require data from several databases. Each database holds different data, and therefore researchers should identify what types of environmental or ecological data they will need to answer their research question. '''Collect''': Data is collected in several different ways. In ecoinformatics, this is usually restricted to manually entering data into a spreadsheet, and parsing data from an existing database. The growth of [[Relational database|relational databases]] has made it easier for ecologists to download relevant data and integrate datasets together '''Assure''': Data entries should be checked thoroughly to validate their accuracy and usability, such as to check for outliers and erroneous points. The same principle applies to data downloaded from datasets. This responsibility falls on both the ecologist downloading the data, and the entity that sets up the data collection system. '''Describe''': An accurate description of the metadata of a dataset that is used in a study should include enough information to deduce the data collection and processing methodology, when the data were collected, why the data were collected, and how the data were stored. This is important for reproducibility, especially for projects that build on each other and may recycle data '''Preserve''': After data is collected by an institutional entity, it should be archived such that it is easily accessible. Ideally, this is in databases that are maintained and not at risk of deprecation '''Discover''': While there are good practices for discovering data to start a research project, this process is often marred by a lack of usable, published data, as researchers may collect data specific to their study, but may not publish this data for wider use. On the data collection end, this can be addressed by better data-sharing practices, such as by linking datasets when publishing papers or studies. On the data procurement end, this can be addressed by more precise data searching, such as using key words to find relevant datasets. '''Integrate''': Synthesizing datasets together can be difficult and labor-intensive, largely due to the methodological differences in data collection. There are several approaches to this, but the best practices typically involve computational approaches, namely using [[R (programming language)|R]] or [[Python (programming language)|Python]], to automate the processes and prevent errors '''Analyze''': Data analysis can take several forms, and should be tailored to the specific ecological project. However, all data analysis methods should be well-documented, including the procedure for analysis, justification for analysis methods, and any shortcomings in a specific approach. == Applications of Ecoinformatics Across Ecology == === [[Ecosystem ecology|Ecosystem Ecology]] === Source:<ref>{{Cite journal |last1=LaDeau |first1=S. L. |last2=Han |first2=B. A. |last3=Rosi-Marshall |first3=E. J. |last4=Weathers |first4=K. C. |date=2017-03-01 |title=The Next Decade of Big Data in Ecosystem Science |journal=Ecosystems |language=en |volume=20 |issue=2 |pages=274β283 |doi=10.1007/s10021-016-0075-y |issn=1435-0629|doi-access=free |bibcode=2017Ecosy..20..274L }}</ref> Ecosystem studies, by definition, encompass interactions across the entire life sciences spectrum, from microscopic biochemical reactions to large-scale geological phenomena. As a result, big databases may not be designed specifically for any particular research question, but should be inclusive enough to support most studies. Since ecosystem-level questions require a broad perspective, data-related ecosystem projects would likely incorporate data from several databases. A common framework for incorporating data into ecosystem-level studies is the [[network science]] model, in which data collection mechanisms and resources are treated like a large, interconnected network instead of individual entities. The network may include several data collection stations within one databases, or may span across multiple databases. Currently there are several large-scale networks, but they do not generate data on the scale to consider ecology as a big data science. A current challenge for ecoinformatics in ecosystem ecology is that most funding is prioritized for generating new data rather than maintaining existing data infrastructures. Integrating data across the different spatial scales can also be difficult, since each dataset may hold different types of data. === [[Urban ecology|Urban Ecology]] === Source:<ref>{{Cite journal |last=Yang |first=Jun |date=2020-10-01 |title=Big data and the future of urban ecology: From the concept to results |url=https://doi.org/10.1007/s11430-020-9666-3 |journal=Science China Earth Sciences |language=en |volume=63 |issue=10 |pages=1443β1456 |doi=10.1007/s11430-020-9666-3 |bibcode=2020ScChD..63.1443Y |s2cid=221285047 |issn=1869-1897|url-access=subscription }}</ref> The current push for [[Smart city|smart cities]], and [[Wireless sensor network|sensor network]] integration into infrastructure, has positioned as a major source of data for ecological studies. Typical urban ecology questions address the effects of urbanization on the local ecosystem, and how to drive future development to promote urban [[biodiversity]]. While sensor networks in cities typically collect environmental data to optimize city processes, they may also be used for ecological initiatives, especially with respect to understanding the complex, multi-layered relationship between cities and their local ecosystem. It can also be used to better understand the current landscape of cities, and identify avenues for rewinding of cities. For example, analyzing mobility patterns can identify areas that may lend themselves well to building parks and green spaces. [[Birdwatching|Bird watching]] data can also be used to identify the types of bird species in a local area. === Infectious Disease === Source:<ref>{{Cite journal |last=Kasson |first=Peter M. |date=2020-07-20 |title=Infectious Disease Research in the Era of Big Data |journal=Annual Review of Biomedical Data Science |language=en |volume=3 |issue=1 |pages=43β59 |doi=10.1146/annurev-biodatasci-121219-025722 |issn=2574-3414|doi-access=free }}</ref> Like other disciplines of ecology, [[emerging infectious disease]] and [[epidemiology]] span multiple scales, from understanding the genetics that drive disease trends to large-scale spatiotemporal analyses. As a result, infectious disease studies can incorporate everything from bioinformatics, genetic sequences, amino acid sequences, and environmental observation data. On the micro-scale, these data can then be used to predict infectivity/transmissibility, drug resistance, drug candidates, and mutation sites. On the macro-scale, it can be used to identify societal trends or environmental factors that lend themselves to spillover, locations of infection, and practices that cause disease transmission. == Databases == Source:<ref>{{Cite journal |last1=Farley |first1=Scott S |last2=Dawson |first2=Andria |last3=Goring |first3=Simon J |last4=Williams |first4=John W |date=2018-07-18 |title=Situating Ecology as a Big-Data Science: Current Advances, Challenges, and Solutions |journal=BioScience |volume=68 |issue=8 |pages=563β576 |doi=10.1093/biosci/biy068 |issn=0006-3568|doi-access=free }}</ref> * USGS National Streamflow sensor network * [[Global Biodiversity Information Facility|GBIF]] * Neotoma * Paleobiology database * European Vegetation Archive * USDA Forest Inventory Analysis * [https://www.try-db.org/TryWeb/Database.php TRY] * [https://bien.nceas.ucsb.edu/bien/ BIEN] * AmeriFlux * [https://p2infohouse.org/ref/01/00047/7-09.htm TEAM] * [[iNaturalist]] * [[National Ecological Observatory Network|NEON]] * [[Global Lake Ecological Observatory Network|GLEON]] * [[Long Term Ecological Research Network|LTER]] * [https://czo-archive.criticalzone.org/national/ CZO] * [https://www.tern.org.au/research-resources/ TERN] * [[South African Environmental Observation Network|SAEON]] ==References== {{Reflist}} ==External links== *[http://www.ecoinformatics.org ecoinformatics.org], Online Resource for Managing Ecological Data and Information *[https://web.archive.org/web/20031222200440/http://ecoinformatics.uvm.edu/ Ecoinformatics Collaboratory], Research links and public wiki for discussion. *[https://web.archive.org/web/20060831180722/http://ecoinformatics.oregonstate.edu/ Ecoinformatics Education], Ecosystem Informatics at Oregon State University *[http://www.bui-berlin.de industrial Environmental Informatics], Industrial Environmental Informatics at HTW-Berlin, University of Applied Sciences *[https://web.archive.org/web/20080919211629/http://www.waite.adelaide.edu.au/ISEI/ International Society for Ecological Informatics] *[https://web.archive.org/web/20070514211026/http://www.science.uottawa.ca/~jkerr/ Canadian Facility for Ecoinformatics Research], Ecoinformatics at the University of Ottawa, Canada *[http://www.nceas.ucsb.edu/ecoinfo Ecoinformatics program at the National Center for Ecological Analysis & Synthesis] {{Webarchive|url=https://web.archive.org/web/20141105003935/https://www.nceas.ucsb.edu/ecoinfo |date=2014-11-05 }} *[http://www.elsevier.com/wps/find/journaldescription.cws_home/705192/description Ecological Informatics: An International Journal on Computational Ecology and Ecological Data Science] *[http://www.ecologicaldata.org/ Ecological Data] *[https://www.nsf.gov/pubs/2007/nsf07601/nsf07601.htm NSF DataNet call for proposals] *[http://www.dataone.org DataONE] *[http://www.dataconservancy.org Data Conservancy] *[http://eco-informatics.engr.oregonstate.edu/], EcoInformatics Summer Institute, an NSF-funded REU site (Research Experience for Undergraduates) {{Informatics}} {{Authority control}} [[Category:Ecology]] [[Category:Information science]]
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)
Pages transcluded onto the current version of this page
(
help
)
:
Template:Authority control
(
edit
)
Template:Cite journal
(
edit
)
Template:Cite web
(
edit
)
Template:Informatics
(
edit
)
Template:More references
(
edit
)
Template:Reflist
(
edit
)
Template:Webarchive
(
edit
)