TY - JOUR
T1 - A specialized data crawler for cross-laminated timber information resources
AU - Thomas, Ed
AU - Espinoza, Omar
AU - Bora, Rahul
AU - Buehlmann, Urs
N1 - Publisher Copyright:
© Forest Products Society 2020.
PY - 2020
Y1 - 2020
N2 - The Internet is composed of more than 6.2 billion Web pages and grows larger every day. As the number of links and specialty subject areas grows, it becomes ever more difficult to find pertinent information. For some subject areas, special-purpose data crawlers continually search the Internet for specific information; examples include real estate, air travel, auto sales, and others. The use of such special-purpose data crawlers (i.e., targeted crawlers and knowledge databases) also allows the collection and analysis of agricultural and forestry data. Such single-purpose crawlers can search for hundreds of key words and use machine learning to determine if what is found is relevant. In this article, we examine the design and data return of such a specialty knowledge database and crawler system developed to find information related to cross-laminated timber (CLT). Our search engine uses intelligent software to locate and update pertinent references related to CLT as well as to categorize information with respect to common application and interest areas. At the time of this publication, the CLT knowledge database has cataloged nearly 3,000 publications regarding various aspects of CLT.
AB - The Internet is composed of more than 6.2 billion Web pages and grows larger every day. As the number of links and specialty subject areas grows, it becomes ever more difficult to find pertinent information. For some subject areas, special-purpose data crawlers continually search the Internet for specific information; examples include real estate, air travel, auto sales, and others. The use of such special-purpose data crawlers (i.e., targeted crawlers and knowledge databases) also allows the collection and analysis of agricultural and forestry data. Such single-purpose crawlers can search for hundreds of key words and use machine learning to determine if what is found is relevant. In this article, we examine the design and data return of such a specialty knowledge database and crawler system developed to find information related to cross-laminated timber (CLT). Our search engine uses intelligent software to locate and update pertinent references related to CLT as well as to categorize information with respect to common application and interest areas. At the time of this publication, the CLT knowledge database has cataloged nearly 3,000 publications regarding various aspects of CLT.
UR - http://www.scopus.com/inward/record.url?scp=85095767111&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85095767111&partnerID=8YFLogxK
U2 - 10.13073/FPJ-D-20-00017
DO - 10.13073/FPJ-D-20-00017
M3 - Article
AN - SCOPUS:85095767111
SN - 0015-7473
VL - 70
SP - 256
EP - 261
JO - Forest Products Journal
JF - Forest Products Journal
IS - 3
ER -