All Authors in scopus First Author Second Author Third Author Title Year Source title Volume Issue Art. No. Page start Page end Page count Cited by Link Affiliations Authors with affiliations Abstract Author Keywords Index Keywords Molecular Sequence Numbers Chemicals/CAS Tradenames Manufacturers Funding Details References Correspondence Address Editors Sponsors Publisher Conference name Conference date Conference location Conference code ISSN ISBN CODEN DOI PubMed ID Language of Original Document Abbreviated Source Title Document Type Source GB search Thelwall M. Thelwall Extracting macroscopic information from Web links 2001 Journal of the American Society for Information Science and Technology 52 13 1157 1168 108 https://www.scopus.com/inward/record.url?eid=2-s2.0-0035498756&partnerID=40&md5=26b008e57be24121afd78d6eaf8c7da5 School of Computing and Information Technology, University of Wolverhampton, Wulfruna Street, Wolverhampton WV1 1SB, United Kingdom Thelwall, M., School of Computing and Information Technology, University of Wolverhampton, Wulfruna Street, Wolverhampton WV1 1SB, United Kingdom Much has been written about the potential and pitfalls of macroscopic Web-based link analysis, yet there have been no studies that have provided clear statistical evidence that any of the proposed calculations can produce results over large areas of the Web that correlate with phenomena external to the Internet. This article attempts to provide such evidence through an evaluation of Ingwersen's (1998) proposed external Web Impact Factor (WIF) for the original use of the Web: the interlinking of academic research. In particular, it studies the case of the relationship between academic hyperlinks and research activity for universities in Britain, a country chosen for its variety of institutions and the existence of an official government rating exercise for research. After reviewing the numerous reasons why link counts may be unreliable, it demonstrates that four different WIFs do, in fact, correlate with the conventional academic research measures. The WIF delivering the greatest correlation with research rankings was the ratio of Web pages with links pointing at research-based pages to faculty numbers. The scarcity of links to electronic academic papers in the data set suggests that, in contrast to citation analysis, this WIF is measuring the reputations of universities and their scholars, rather than the quality of their publications. Evaluation; Information retrieval; Search engines; Websites; Web impact factor (WIF); Web links; Information science Almind, T.C., Ingwersen, P., Informetric analyses on the world wide web: Methodological approaches to "webometrics" (1998) Journal of Documentation, 53 (4), pp. 404-426; Amento, B., Hil, W., Terveen, L., Hix, D., Ju, P., An empirical evaluation of user interfaces for topic management of web sites (1999) CHI 99 Conference Proceedings, pp. 552-559. , New York: Addison Wesley; Anderson, A., No citation analyses please, we're British (1991) Science, 252, p. 639; Bar-Ilan, J., Search engine results over time - A case study on search engine stability (1999) Cybermetrics, 2-3. , http://www.cindoc.csic.es/cybermetrics/articles/v2ilp1.html; Bar-Ilan, J., The Web as an information source on Informetrics? A content analysis (2000) Journal of the American Society for Information Science, 51 (5), pp. 432-443; Bar-Ilan, J., Data collection methods on the Web for informetric purposes - A review and analysis (2001) Scientometrics, 50 (1), pp. 7-32; Berners-Lee, T., Hendler, J., Scientific publishing on the "semantic web" (2001) Nature, 410, pp. 1023-1024; Biddle, J., A citation analysis of the sources and extent of Wesley Mitchell's reputation (1996) History of Political Economy, 28 (2), pp. 137-169; Björneborn, L., Ingwersen, P., Perspectives of webometrics (2001) Scientometrics, 50 (1), pp. 65-82; Borgman, C.L., Digital libraries and the continuum of scholarly communication (2000) Journal of Documentation, 56 (4), pp. 412-430; Brin, S., Page, L., The anatomy of a large scale hypertextual web search engine (1998) Computer Networks and ISDN Systems, 30 (1-7), pp. 107-117; Broder, A., Kumar, R., Maghoull, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., Wiener, J., Graph structure in the web (2000) Computer Networks, 33 (1-6), pp. 309-320; Case, D.O., Higgins, G.M., How can we investigate citation behaviour? A study of reasons for citing literature in communication (2000) Journal of the American Society for Information Science, 51 (7), pp. 635-645; Chakrabarti, S., Dom, B., Gibson, D., Kleinberg, J., Kumar, S.R., Raghavan, P., Rajagopalan, S., Tomkins, A., (1999) Hypersearching the web, pp. 54-60. , Scientific American, June; Chen, C., Structuring and visualising the World-Wide Web with generalised similarity analysis (1997) Proceedings of the 8th ACM conference on hypertext (Hypertext '97), , http://www.brunel.ac.uk/~cssrccc2/papers/ht97.pdf, April, 1997. Southampton, UK; Chen, C., Newman, J., Newnam, R., Rada, R., How did university departments interweave the Web: A study of connectivity and underlying factors (1998) Interacting with Computers, 10, pp. 353-373; Choo, J., Garcia-Molina, H., The evolution of the Web and implications for an incremental crawler (2000) Proceedings of the 26th VLDB Conference, pp. 200-209. , Cairo, Egypt; Cronin, B., Bibliometrics and beyond: Some thoughts on web-based citation analysis (2001) Journal of Information Science, 27 (1), pp. 1-7; Cronin, B., Hyperauthorship: A postmodern perversion or evidence of a structural shift in scholarly communication practices? (2001) Journal of the American Society for Information Science & Technology, 52 (7); Cronin, B., McKim, G., Science and scholarship on the world wide web: A North American perspective (1996) Journal of Documentation, 52 (2), pp. 163-171; Cronin, B., Snyder, H.W., Rosenbaum, H., Martinson, A., Callahan, E., Invoked on the web (1998) Journal of the American Society for Information Science, 49 (14), pp. 1319-1328; Davenport, E., Cronin, B., The citation network as a prototype for representing trust in virtual environments (2000) The web of knowledge: A festschrift in honor of Eugene Garfield, pp. 517-534. , B. Cronin & H.B. Atkins (Eds.). Metford, NJ: Information Today Inc. ASIS Monograph Series; Davenport, E., Cronin, B., Who dunnit? Metatags and hyperauthorship Journal of the American Society for Information Science & Technology, , in press; Egghe, L., New informetric aspects of the Internet: Some reflections - Many problems (2000) Journal of Information Science, 26 (5), pp. 329-335; Elkin, J., Law, D., The 1996 research assessment exercise: The library and information management panel (1997) Journal of Librarianship and Information Science, 29 (3), pp. 131-141; (2001) Fifth framework programme, , http://europa.eu.int/comm/research/fp5.html, Accessed 16 February, 2001; Fosmire, M., Yu, S., Free scholarly electronic journals: How good are they? (2000) Issues in Science and Technology Librarianship, , http://www.library.ucsb.edu/istl/00-summer/refereed.html, Summer 2000; Garfield, E., (1979) Citation indexing: Its theory and applications in science, technology and the humanities, , New York: Wiley Interscience; Garfield, E., The impact factor (1994) Current Contents, , http://www.isinet.com/isi/hot/essays/journalcitationreports/7.html, June 20; Gibson, D., Kleinberg, J., Raghavan, P., Inferring web communities from link topology (1998) Hypertext 98: Ninth ACM Conference on Hypertext and Hypermedia, , New York: ACM; Gowrishankar, J., Divakar, P., Baylis, M., Gravenor, M., Kao, R., Sprucing up one's impact factor (two letters to the editor) (1999) Nature, 401, pp. 321-322; Haas, S.W., Grams, E.S., Readers, authors and page structure: A discussion of four questions arising from a content analysis of web pages (2000) Journal of the American Society for Information Science, 51 (2), pp. 181-192; Harnad, S., Carr, L., Integrating, navigating, and analysing open eprint archives through open citation linking (the OpCit project) (2000) Current Science, 79 (5), pp. 629-638; Harter, S.P., Ford, C.E., Web-based analyses of e-journal impact: Approaches, problems, and issues (2000) Journal of the American Society for Information Science, 51 (13), pp. 1159-1176; Harter, S.P., Taemin, K.P., Impact of prior electronic publication on manuscript consideration policies of scholarly journals (2000) Journal of the American Society for Information Science, 51 (10), pp. 940-948; (1998) An introduction to the work of the Higher Education Funding Council for England, , http://www.hefce.ac.uk/Pubs/HEFCE/1998/98_16.htm; Hernández-Borges, A.A., Macías-Cervi, P., Gaspar-Guardado, M.A., Torres-Álvarez de Arcaya, M.L., Ruiz-Rabaza, A., Jiménez-Sosa, A., Can examination of WWW usage statistics and other indirect quality indicators distinguish the relative quality of medical web sites? (1999) Journal of Medical Internet Research, 1 (1). , http://www.jmir.org/1999/l/el/index.htm; Heydon, A., Najork, M., Mercator: A scalable, extensible Web crawler (1999) World Wide Web, 2, pp. 219-229; Holmes, A., Oppenheim, C., Use of citation analysis to predict the outcome of the 2001 RAE for Unit of Assessment 61: Library and information management (2001) Information Research, 6 (2). , http://www.shef.ac.uk/~is/publications/infres/6-2/paper103.html; Ingwersen, P., The calculation of web impact factors (1998) Journal of Documentation, 54 (2), pp. 236-243; Kelly, B., WebWatch: A survey of links to UK university web sites (2000) Ariadne, 23. , http://www.ariadne.ac.uk/issue23/web-watch/; Kim, H.J., Motivations for hyperlinking in scholarly electronic articles: A qualitative study (2000) Journal of the American Society for Information Science, 51 (10), pp. 887-899; Kleinberg, J., Authoritative sources in a hyperlinked environment (1999) Journal of the ACM; Kling, R., McKim, G., Scholarly communication and the continuum of electronic publishing (1999) Journal of the American Society for Information Science, 50 (10), pp. 890-906; Knudsen, I., Haug, G., Kirstein, J., (1999) Trends in learning structures in higher education, , http://www.rks.dk/trends1.htm, Accessed: 7 March 2001; Lawrence, S., Giles, C.L., Accessibility of information on the web (1999) Nature, 400, pp. 107-109; Leydesdorff, L., Curran, M., Mapping university-industry-government relations on the Internet: The construction of indicators for a knowledge-based economy (2000) Cybermetrics, 4. , http://www.cindoc.csic.es/cybermetrics/articles/v4i1p2.html; League tables 2000 (2000) The Times Higher Education Supplement, pp. II-III. , Mayfield University Consultants., April 14; McDonald, S., Stevenson, R.J., Navigation in hyperspace: An evaluation of the effects of navigational tools and subject matter expertise on browsing and information retrieval in hypertext (1998) Interacting with Computers, 10 (2), pp. 129-142; Middleton, I., McConnell, M., Davidson, G., Presenting a model for the structure and content of a university World Wide Web site (1999) Journal of Information Science, 25 (3), pp. 219-227; Oppenheim, C., The correlation between citation counts and the 1992 research assessment exercises ratings for British library and information science departments (1995) Journal of Documentation, 51, pp. 18-27; Oppenheim, C., The correlation between citation counts and the 1992 research assessment exercise ratings for British research in genetics, anatomy and archaeology (1997) Journal of Documentation, 53, pp. 477-487; Oppenheim, C., Do patent citations count? (2000) The web of knowledge: A festschrift in honor of Eugene Garfield, pp. 405-432. , B. Cronin & H.B. Atkins (Eds.). Metford, NJ: Information Today Inc. ASIS Monograph Series; (1999) Noble's higher education financial yearbook, , Noble; Rousseau, R., Sitations: An exploratory study (1997) Cybermetrics, 1. , http://www.cindoc.csic.es/cybermetrics/articles/vlilp1.html, Scotland: Edinburgh; Rousseau, R., Daily time series of common single word searches in Alta Vista and NorthernLight (1999) Cybermetrics, 2-3. , http://www.cindoc.csic.es/cybermetrics/articles/v2ilp2.html; Smith, A.G., A tale of two web spaces: Comparing sites using Web Impact Factors (1999) Journal of Documentation, 55 (5), pp. 577-592; Snyder, H., Rosenbaum, H., Can search engines be used for web-link analysis? A critical review (1999) Journal of Documentation, 55 (4), pp. 375-384; Steiger, J.H., Tests for comparing elements of a correlation matrix (1980) Psychological Bulletin, 87 (2), pp. 245-251; Thelwall, M., Will MANs and SuperJANET dominate educational technology in the UK? (1999) International Journal of Educational Technology, 1 (1). , http://www.amstat.org/publications/jse/; Thelwall, M., Web impact factors and search engine coverage (2000) Journal of Documentation, 56 (2), pp. 185-189; Thelwall, M., Results from a Web impact factor crawler (2001) Journal of Documentation, 57 (2), pp. 177-191; Thelwall, M., The responsiveness of search engine indexes (2001) Cybermetrics, 5 (1). , http://www.cindoc.csic.es/cybermetrics/articles/v5ilp1.html; Thelwall, M., A Web crawler design for data mining (2001) Journal of Information Science, , in press; Thelwall, M., (2001) Applying multivariate statistical analysis to university web links, , University of Wolverhampton; Thomas, O., Willet, P., Webometric analysis of departments of librarianship and information science (2000) Journal of Information Science, 26 (6), pp. 421-428; Warner, J., A critical review of the application of citation studies to the research assessment exercises (2000) Journal of Information Science, 26 (6), pp. 453-460. , Includes comment by Oppenheim; Warner, J., (2000) Research assessment and citation analysis, 14 (21), p. 39. , http://www.the-scientist.com/yr2000/oct/opin_001030.html; (1999) Performance, implementation, and design notes, , http://www.w3.org/TR/html4/appendix/notes.html#h-B.4.1.1, World Wide Web Consortium. Accessed February 27 Thelwall, M.; Sch. of Computing and Info. Technol., University of Wolverhampton, Wulfruna Street, Wolverhampton WV1 1SB, United Kingdom; email: m.thelwall@wlv.ac.uk 15322882 AISJB 10.1002/asi.1182 English Journal of the American Society for Information Science and Technology Article Scopus Thelwall Extracting macroscopic information from Web links Journal of the American Society for Information Science and Technology 2001 Vaughan L., Thelwall M. Vaughan Thelwall Search engine coverage bias: Evidence and possible causes 2004 Information Processing and Management 40 4 693 707 86 https://www.scopus.com/inward/record.url?eid=2-s2.0-2942610876&partnerID=40&md5=904002e6a0f0931bb6a0ee3da7dfc069 Fac. of Info. and Media Studies, University of Western Ontario, London, Ont. N6A 5B7, Canada; Sch. of Comp./Information Technology, University of Wolverhampton, 35/49 Lichfield Street, Wolverhampton WV1 1EQ, United Kingdom Vaughan, L., Fac. of Info. and Media Studies, University of Western Ontario, London, Ont. N6A 5B7, Canada; Thelwall, M., Sch. of Comp./Information Technology, University of Wolverhampton, 35/49 Lichfield Street, Wolverhampton WV1 1EQ, United Kingdom Commercial search engines are now playing an increasingly important role in Web information dissemination and access. Of particular interest to business and national governments is whether the big engines have coverage biased towards the US or other countries. In our study we tested for national biases in three major search engines and found significant differences in their coverage of commercial Web sites. The US sites were much better covered than the others in the study: sites from China, Taiwan and Singapore. We then examined the possible technical causes of the differences and found that the language of a site does not affect its coverage by search engines. However, the visibility of a site, measured by the number of links to it, affects its chance to be covered by search engines. We conclude that the coverage bias does exist but this is due not to deliberate choices of the search engines but occurs as a natural result of cumulative advantage effects of US sites on the Web. Nevertheless, the bias remains a cause for international concern. © 2003 Elsevier Ltd. All rights reserved. Cross-country comparison; Linguistic factor; Search engine coverage bias Electronic commerce; Government data processing; Information dissemination; International cooperation; Online searching; Problem solving; Websites; Cross-country comparison; Linguistic factors; Online business; Search engine coverage bias; Search engines Albitz, P., Liu, C., (2001) DNS and BIND. 4th Ed., , Sebastopol, CA: O'Reilly; Bar-Ilan, J., Search engine results over time - A case study on search engine stability (1999) Cybermetrics, 2-3. , http://www.cindoc.csic.es/cybermetrics/articles/v2i1p1.html; Bar-Ilan, J., Methods for assessing search engine performance over time (2002) Journal of the American Society for Information Science and Technology, 53 (4), pp. 308-319; (2002) China Criticised for Ban on Google, , http://news.bbc.co.uk/1/hi/technology/2238236.stm, Retrieved November 27, 2002; Bharat, K., Chang, B., Henzinger, M., Ruhl, M., Who links to whom: Mining linkage between web sites (2001) Proceedings IEEE International Conference on Data Mining (ICDM), , http://theory.lcs.mit.edu/~ruhl/papers/2001-icdm.pdf, San Jose, Nov. 2001. Retrieved June 9, 2003; Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., Wiener, J., Graph structure in the web (2000) Journal of Computer Networks, 33 (1-6), pp. 309-320; Gordon, M., Pathak, P., Finding information on the world wide web: The retrieval effectiveness of search engines (1999) Information Processing and Management, 35 (2), pp. 141-180; Grefenstette, G., Nioche, J., Estimation of english and non-english language use on the WWW (2000) Proceedings of the RIAO'2000 Conference, , http://133.23.229.11/~ysuzuki/Proceedingsall/RIAO2000/Wednesday/ 20plenary2.pdf, Paris: C.I.D. Retrieved December 20, 2001; Hawking, D., Craswell, N., Bailey, P., Griffiths, K., Measuring search engine quality (2001) Information Retrieval, 4 (1), pp. 33-59; Henzinger, M.R., Heydon, A., Mitzenmacher, M., Najork, M., Measuring index quality using random walks on the web (1999) Computer Networks and ISDN Systems, 31 (11-16), pp. 1291-1303; Howell, D., (2002) Statistical Methods for Psychology. 5th Ed., , Pacific Grove, CA, USA: Duxbury; Huberman, B.A., Adamic, L.A., Growth dynamics of the world wide web (1999) Nature, 401, p. 131; Introna, L.D., Nissenbaum, H., Shaping the web: Why the politics of search engines matters (2000) The Information Society, 16, pp. 169-185; Ju-Pak, K.H., Content dimensions of web advertising: A cross-national comparison (1999) International Journal of Advertising, 18 (2), pp. 207-231; Judd, C.M., Mcclelland, G.H., (1989) Data Analysis: A Model-comparison Approach, , San Diego, USA: Harcourt Brace Jovanovich; Lawrence, S., Giles, C.L., Accessibility of information on the web (1999) Nature, 400, pp. 107-109; Mettrop, W., Nieuwenhuysen, P., Internet search engines - Fluctuations in document accessibility (2001) Journal of Documentation, 57 (5), pp. 623-651; Moukdad, H., Language-based retrieval of web documents: An analysis of arabic-recognition capabilities of two major search engines (2002) Proceedings of the 65th ASIST Annual Meeting, 39, p. 551. , ASIST 2002; Mowshowitz, A., Kawaguchi, A., Assessing bias in search engines (2002) Information Processing and Management, 38 (1), pp. 141-156; Nie, J.Y., Ren, F., Chinese information retrieval: Using characters or words? (1999) Information Processing and Management, 35 (4), pp. 443-462; Notess, G., (2002) Search Engine Statistics: Relative Size Showdown, , http://www.searchengineshowdown.com/stats/size.shtml, Retrieved August 1, 2002; (2002) Country and Language, , http://wcp.oclc.org/, Retrieved September 4, 2002; O'Neill, E.T., McClain, P.D., Lavoie, B.F., (1997) A Methodology for Sampling the World Wide Web, , http://www.oclc.org/research/publications/arr/1997/oneill/ o%27neillar980213.htm, Retrieved December 2, 2002; Oyama, K., Ishida, E., Kando, N., (2003) NTCIR Workshop3: Proceedings of the Third NTCIR Workshop on Research in Information Retrieval, Automatic Text Summarization and Question Answering, , http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings3/index.html, Retrieved June 9, 2003; Peters, C., Cross-language information retrieval and evaluation: Workshop of the cross-language evaluation forum, clef 2000 (2001) Lecture Notes in Computer Science, 2069. , Lisbon, Portugal, September 21-22, 2000: revised papers. Berlin: Springer; Rousseau, R., Situations: An exploratory study (1997) Cybermetrics, 1. , http://www.cindoc.csic.es/cybermetrics/articles/v1i1p1.html, Retrieved December 2, 2002; Rousseau, R., Daily time series of common single word searches in AltaVista and NorthernLight (1999) Cybermetrics, 2-3. , http://www.cindoc.csic.es/cybermetrics/articles/v2i1p2.html, Retrieved December 2, 2002; Selberg, E., Etzioni, O., On the instability of web search engines (2000) Proceedings RIAO, , http://citeseer.nj.nec.com/selberg00instability.html, Paris. April 2000. PDF file retrieved June 9, 2003; Sullivan, D., (2001) AltaVista Regional Listings Left to Rot, , http://searchenginewatch.com/sereport/01/09-altavista.html, Retrieved November 27, 2002; Thelwall, M., Commercial web sites: Lost in cyberspace? (2000) Internet Research: Electronic Networking and Applications, 10 (2), pp. 150-159; Thelwall, M., A web crawler design for data mining (2001) Journal of Information Science, 27 (5), pp. 319-325; Thelwall, M., (2001) A Publicly Accessible Database of UK University Website Links and a Discussion of the Need for Human Intervention in Web Crawling, , http://www.scit.wlv.ac.uk/~cm1993/papers/a_publicly_accessible_database. pdf, Retrieved January 14, 2003; Thelwall, M., The responsiveness of search engine indexes (2001) Cybermetrics, 5 (1). , http://www.cindoc.csic.es/cybermetrics/articles/v5i1p1.html, Retrieved January 14, 2003; Thelwall, M., Methodologies for crawler based web surveys (2002) Internet Research: Electronic Networking and Applications, 12 (2), pp. 124-138; Thelwall, M., Evidence for the existence of geographic trends in university web site interlinking (2002) Journal of Documentation, 58 (5), pp. 563-574; Thelwall, M., Smith, A., A study of the interlinking between Asia-Pacific university web sites (2002) Scientometrics, 55 (3), pp. 363-376; Thelwall, M., Wilkinson, D., Graph structure in three national academic webs: Power laws with anomalies (2003) Journal of the American Society for Information Science and Technology, 54 (8), pp. 706-712; Vaughan, L., New measurements for search engine evaluation proposed and tested (2003) Information Processing and Management, , in press; Vaughan, L., Thelwall, M., Scholarly use of the web: What are the key inducers of links to journal web sites? (2003) Journal of the American Society for Information Science and Technology, 54 (1), pp. 29-38; Walker, J., Links and power: The political economy of linking on the web (2002) Proceedings of ACM Hypertext 2002, pp. 72-73 Vaughan, L.; Fac. of Info. and Media Studies, University of Western Ontario, London, Ont. N6A 5B7, Canada; email: lvaughan@uwo.ca 3064573 IPMAD 10.1016/S0306-4573(03)00063-3 English Information Processing and Management Article Scopus Vaughan Thelwall Search engine coverage bias: Evidence and possible causes Information Processing and Management 2004 Vaughan L., Thelwall M. Vaughan Thelwall Scholarly use of the web: What are the key inducers of links to journal web sites? 2003 Journal of the American Society for Information Science and Technology 54 1 29 38 82 https://www.scopus.com/inward/record.url?eid=2-s2.0-0037224148&partnerID=40&md5=2111d87e4a603c4f663d9331f3c30bf3 Faculty of Info. and Media Studies, University of Western Ontario, London, Ont. N6A 5B7, Canada; Sch. of Computing and Info. Technol., University of Wolverhampton, 35/49 Lichfield Street, Wolverhampton WV1 1EQ, United Kingdom Vaughan, L., Faculty of Info. and Media Studies, University of Western Ontario, London, Ont. N6A 5B7, Canada; Thelwall, M., Sch. of Computing and Info. Technol., University of Wolverhampton, 35/49 Lichfield Street, Wolverhampton WV1 1EQ, United Kingdom Web links have been studied by information scientists for at least six years but it is only in the past two that clear evidence has emerged to show that counts of links to scholarly Web spaces (universities and departments) can correlate significantly with research measures, giving some credence to their use for the investigation of scholarly communication. This paper reports on a study to investigate the factors that influence the creation of links to journal Web sites. An empirical approach is used: collecting data and testing for significant patterns. The specific questions addressed are whether site age and site content are inducers of links to a journal's Web site as measured by the ratio of link counts to Journal Impact Factors, two variables previously discovered to be related. A new methodology for data collection is also introduced that uses the Internet Archive to obtain an earliest known creation date for Web sites. The results show that both site age and site content are significant factors for the disciplines studied: library and information science, and law. Comparisons between the two fields also show disciplinary differences in Web site characteristics. Scholars and publishers should be particularly aware that richer content on a journal's Web site tends to generate links and thus the traffic to the site. Data acquisition; Digital libraries; Education; Information science; Laws and legislation; Publishing; Search engines; Telecommunication traffic; Websites; Academia; Departments; Internet archive; Journal web sites; Scholarly communication; Universities; Web links; Information retrieval Adam, D., The counting house (2002) Nature, 415, pp. 726-729; Almind, T.C., Ingwersen, P., Informetric analysis on the world wide web: Methodological approaches to webometrics (1997) Journal of Documentation, 53 (4), pp. 404-426; AltaVista advanced search tutorial-link popularity (2002), http://help.altavista.com/adv_search/ast_haw_popularity, AltaVista; (25 February, 2002)Bar-Ilan, J., Data collection methods on the web for informetric purposes-a review and analysis (2001) Scientometrics, 50 (1), pp. 7-32; Björneborn, L., Ingwersen, P., Perspectives of webometrics (2001) Scientometrics, 50 (1), pp. 65-82; Brin, S., Page, L., The anatomy of a large scale hypertextual web search engine (1998) Computer Networks and ISDN Systems, 30 (1-7), pp. 107-117. , http://citeseer.nj.nec.com/brin98anatomy.html; Chavez-Demoulin, V.C., Roehrl, A.S.A., Roehrl, R.A., Weinberg, A., The WEB archives: A time machine in your pocket (2000), http://cite-seer.nj.nec.com/chavez-demoulin99web.html, Internet Archive Colloquium, San Francisco, March 2000Chu, H., He, S., Thelwall, M., Library and information science schools in Canada and USA: A webometric perspective Journal of Education for Library and Information Science, , (in press); Cole, J.R., A short history of the use of citations as a measure of the impact of scientific and scholarly work (2000) The Web of Knowledge: A Festschrift in Honor of Eugene Garfield, pp. 281-300. , In: Cronin, B., & Atkins, H.B. (eds.); Metford, NJ: Information Today Inc. ASIS Monograph Series; Cronin, B., Bibliometrics and beyond: Some thoughts on web-based citation analysis (2001) Journal of Information Science, 27 (1), pp. 1-7; Davenport, E., Cronin, B., The citation network as a prototype for representing trust in virtual environments (2000) The Web of Knowledge: A Festschrift in Honor of Eugene Garfield, pp. 517-534. , In: Cronin, B., & Atkins, H.B. (eds.); Metford, NJ: Information Today Inc. ASIS Monograph Series; Garfield, E., The impact factor (1994) Current Contents, , http://www.isinet.com/isi/hot/essays/journalcitationreports/7.html, June 20; Goodrum, A.A., McCain, K.W., Lawrence, S., Giles, C.L., Scholarly publishing in the internet age: A citation analysis of computer science literature (2001) Information Processing & Management, 37 (5), pp. 661-676; Harter, S., Ford, C., Web-based analysis of E-journal impact: Approaches, problems, and issues (2000) Journal of the American Society for Information Science, 51 (13), pp. 1159-1176; Howell, D., (2002) Statistical Methods for Psychology, 5th Ed., , Pacific Grove, CA: Duxbury; Ingwersen, P., The calculation of web impact factors (1998) Journal of Documentation, 54 (2), pp. 236-243; Kim, H.J., Motivations for hyperlinking in scholarly electronic articles: A qualitative study (2000) Journal of the American Society for Information Science, 51 (10), pp. 887-899; Kling, R., McKim, G., Scholarly communication and the continuum of electronic publishing (1999) Journal of the American Society for Information Science, 50 (10), pp. 890-906; Kling, R., McKim, G., Not just a matter of time: Field differences in the shaping of electronic media in supporting scientific communication (2000) Journal of the American Society for Information Science, 51 (14), pp. 1306-1320; Koman, R., How the wayback machine works (2002), http://www.oreillynet.com/pub/a/webservices/2002/01/18/brewster.html, (7 February, 2002)Larson, R.R., Bibliometrics of the world wide web: An exploratory analysis of the intellectual structure of cyberspace (1996) ASIS 96, , http://sherlock.berkeley.edu/asis96.html, (4 August 2001); Lawrence, S.L., Online or invisible? (2001) Nature, 411 (6837), p. 521; Lawrence, S., Giles, C.L., Accessibility of information on the web (1999) Nature, 400, pp. 107-109; Leydesdorff, L., Curran, M., Mapping university-industry-government relations on the internet: The construction of indicators for a knowledge-based economy (2000) Cybermetrics, 4. , http://www.cindoc.csic.es/cybermetrics/articles/v4ilp2.html; Moed, H.F., The impact-factors debate: The ISI's uses and limits (2002) Nature, 415, pp. 731-732; (2002) Northern Light General Help: Webmaster FAQs, , http://www.northernlight.com/docs/gen_help_faq_webmaster.html#rank, NorthernLight; (25 February, 2002); Rousseau, R., Sitations, an exploratory study (1997) Cybermetrics, 1. , http://www.cindoc.csic.es/cybermetrics/articles/vlilpl.html; Rousseau, R., Daily time series of common single word searches in Alta Vista and Northern Light (1998) Cybermetrics, 2-3 (1). , http://www.cindoc.csic.es/cybermetrics/articles/v2ilp2.html; Sandvik, H., Health information and interaction on the internet: A survey of female urinary incontinence (1999) British Medical Journal, 319 (7201), pp. 29-32; Smith, A.G., A tale of two web spaces: Comparing sites using web impact factors (1999) Journal of Documentation, 55 (5), pp. 577-592; Smith, A., Thelwall, M., Web impact factors for Australasian universities (2002) Scientometrics, 54 (1-2), pp. 363-380; Snyder, H., Rosenbaum, H., Can search engines be used as tools for Web-link analysis? A critical view (1999) Journal of Documentation, 55 (4), pp. 375-384; Soualmia, L.F., Darmoni, S.J., Le Duff, F., Douyére, M., Thelwall, M., Web impact factor: A bibliometric criterion applied to medical informatics societies' web sites (2002) Proceedings of MIE 2002. Seventeenth International Congress of the European Federation for Medical Informatics, Studies in Health Technology & Informatics, 90, pp. 178-183; Spink, A., Introduction to the special issue on web research (2002) Journal of the American Society for Information Science and Technology, 53 (2), pp. 65-66; Thelwall, M., Web impact factors and search engine coverage (2000) Journal of Documentation, 56 (2), pp. 185-189; Thelwall, M., Results from a web impact factor crawler (2001) Journal of Documentation, 57 (2), pp. 177-191; Thelwall, M., The responsiveness of search engine indexes (2001) Cybermetrics, 5 (1). , http://www.cindoc.csic.es/cybermetrics/articles/v5ilpl.html; Thelwall, M., Extracting macroscopic information from web links (2001) Journal of the American Society for Information Science and Technology, 52 (13), pp. 1157-1168; Thelwall, M., Methodologies for crawler-based web surveys (2002) Internet Research: Electronic Networking and Applications, 12 (2), pp. 124-138; Thelwall, M., A comparison of sources of links for academic web impact factor calculations (2002) Journal of Documentation, 58, pp. 60-72; Thelwall, M., Conceptualizing documentation on the web: An evaluation of different heuristic-based models for counting links between university web sites Journal of the American Society for Information Science and Technology, , (In press, b); Thomas, O., Willett, P., Webometric analysis of departments of librarianship and information science (2000) Journal of Information Science, 26 (6), pp. 421-428; Vaughan, L., Statistical methods for the information professional: A practical, painless approach to understanding, using, and interpreting statistics (2001), (ASIST Monograph Series), Medford, New Jersey: Information Today, IncVaughan, L., Hysen, K., The impact of journal websites ASLIB Proceedings: New Information Perspectives, , (in press); Vaughan, L., Thelwall, M., Web link counts correlate with ISI impact factors: Evidence from two disciplines Proceedings of the Annual Conference of the American Society for Information Science and Technology, to be Held in Philadelphia, U.S.A., November 18-21, 2002, , (In press) Vaughan, L.; Faculty of Info. and Media Studies, University of Western Ontario, London, Ont. N6A 5B7, Canada; email: Ivaughan@uwo.ca 15322882 AISJB 10.1002/asi.10184 English Journal of the American Society for Information Science and Technology Article Scopus Vaughan Thelwall Scholarly use of the web: What are the key inducers of links to journal web sites? Journal of the American Society for Information Science and Technology 2003 Wilkinson D., Harries G., Thelwall M., Price L. Wilkinson Harries Thelwall Motivations for academic web site interlinking: Evidence for the web as a novel source of information on informal scholarly communication 2003 Journal of Information Science 29 1 49 56 82 https://www.scopus.com/inward/record.url?eid=2-s2.0-0037282197&partnerID=40&md5=c87ca2be15e49f095eaaf3783da8b8f8 Sch. of Computing and Info. Technol., University of Wolverhampton, Wulfruna Street, Wolverhampton, WV1 1SB, United Kingdom Wilkinson, D., Sch. of Computing and Info. Technol., University of Wolverhampton, Wulfruna Street, Wolverhampton, WV1 1SB, United Kingdom; Harries, G., Sch. of Computing and Info. Technol., University of Wolverhampton, Wulfruna Street, Wolverhampton, WV1 1SB, United Kingdom; Thelwall, M., Sch. of Computing and Info. Technol., University of Wolverhampton, Wulfruna Street, Wolverhampton, WV1 1SB, United Kingdom; Price, L., Sch. of Computing and Info. Technol., University of Wolverhampton, Wulfruna Street, Wolverhampton, WV1 1SB, United Kingdom The need to understand authors' motivations for creating links between university web sites is addressed by a survey of a random collection of 414 such links from the ac.uk domain. A classification scheme was created and applied to this collection. Obtaining inter-classifier agreement as to the single main link creation cause was very difficult because of multiple potential motivations and the fluidity of genre on the Web. Nevertheless, it was clear that, whilst the vast majority, over 90%, was created for broadly scholarly reasons, only two were equivalent to journal citations. It is concluded that academic web link metrics will be dominated by a range of informal types of scholarly communication. Since formal communication can be extensively studied through citation analysis, this provides an exciting new window through which to investigate a facet of a previously obscured type of communication activity. Engineering education; Information analysis; Societies and institutions; Surveys; Telecommunication links; Websites; Academic web site; Citation analysis; Informal scholarly communication; Inter-classifier agreement; Interlinking; Information use Tague-Sutcliffe, J., An introduction to informetrics (1992) Information Processing & Management, 28 (1), pp. 1-3; Cronin, B., (1984) The Citation Process: The Role and Significance of Citations in Scientific Communication, , (London: Taylor Graham); Borgman, C., Furner, J., Scholarly communication and bibliometrics (2002) Annual Review of Information Science and Technology, 36, pp. 3-72. , In: Cronin, B. (ed.); (Information Today, Medford, NJ); Cronin, B., Bibliometrics and beyond: Some thoughts on web-based citation analysis (2001) Journal of Information Science, 27 (1), pp. 1-7; Sloan, B., (2001) Personal Citation Index: Exploring the Impact of Selected Papers, , www.lis.uiuc.edu/-b-sloan/pci2.html, (access date 18 June 2002); Sloan, B., (2001) Personal Citation Index, (74). , http://listserv.utk.edu/cgi-bin/wa?A2=ind0111&L=jesse&F=&S= &P=8021, JESSE archives November; Wilson, T., (2002) Web Citation, (6). , http://listserv.utk.edu/cgi-bin/wa?A2=ind0205&L=jesse&F=&S= &P=720, JESSE discussion list. JESSE Archives May; Peterson, I., Touring the scientific web (2001) Science Communication, 22 (3), pp. 246-255; Trumbo, C.W., Sprecker, K.J., Dumlao, R.J., Yu, G.W., Duke, S., Use of e-mail and the web by science writers (2001) Science Communication, 22 (4), pp. 347-378; Davenport, E., Cronin, B., The citation network as a prototype for representing trust in virtual environments (2000) The Web of Knowledge: A Festschrift in Honor of Eugene Garfield. ASIS Monograph Series, pp. 517-534. , In: B. Cronin and H.B. Atkins (eds.); (Information Today Inc., Medford, NJ); Cronin, B., Shaw, D., Banking (on) different forms of symbolic capital Journal of the American Society for Information Science and Technology, , (in press); Thelwall, M., Results from a web impact factor crawler (2001) Journal of Documentation, 57 (2), pp. 177-191; Thomas, O., Willett, P., Webometric analysis of departments of librarianship and information science (2000) Journal of Information Science, 26 (6), pp. 421-428; Chu, H., He, S., Thelwall, M., Library and information science schools in Canada and USA: A Webometric perspective (2002) Journal of Education for Library and Information Science, 43 (2), pp. 110-125; Smith, A., Thelwall, M., Web impact factors for australasian universities (2002) Scientometrics, 54 (1-2), pp. 363-380; Tang, R., Thelwall, M., Exploring the pattern of links between Chinese university web sites (2002) Proceedings of the ASIST Annual Meeting (ASIST 2002), 39, pp. 417-424; Thelwall, M., Extracting macroscopic information from web links (2001) Journal of the American Society for Information Science and Technology, 52 (13), pp. 1157-1168; Thelwall, M., Conceptualizing documentation on the Web: An evaluation of different heuristic-based models for counting links between university web sites (2002) Journal of the American Society for Information Science and Technology, 53 (12), pp. 995-1005; Thelwall, M., A research and institutional size based model for national university web site interlinking (2002) Journal of Documentation, 58 (6), pp. 683-694; Thelwall, M., Evidence for the existence of geographic trends in university web site interlinking (2002) Journal of Documentation, 58 (5), pp. 563-574; Thelwall, M., Web use and peer interconnectivity metrics for academic Web sites (2003) Journal of Information Science, 29 (1), pp. 1-10; Rousseau, R., Sitations, an exploratory study (1997) Cybermetrics, 1. , www.cindoc.csic.es/cynbemetrics/articles/v1i1p1.html; (2002) Electronic Journal of Scientometrics, Informetrics and Bibliometrics, , www.cindoc.csic.es/cybermetrics/cybermetrics.html, Cybermetrics; Brin, S., Page, L., The anatomy of a large scale hypertextual web search engine (1998) Computer Networks and ISDN Systems, 30 (1-7), pp. 107-117; Arasu, A., Cho, J., Garcia-Molina, H., Paepcke, A., Raghavan, S., Searching the web (2001) ACM Transactions on Internet Technology, 1 (1), pp. 2-43; Bar-Ilan, J., Data collection methods on the Web for informetric purposes-a review and analysis (2001) Scientometrics, 50 (1), pp. 7-32; Björneborn, L., Ingwersen, P., Perspectives of webometrics (2001) Scientometrics, 50 (1), pp. 65-82; Van Raan, A.F.J., Bibliometrics and internet: Some observations and expectations (2001) Scientometrics, 50 (1), pp. 59-63; Ingwersen, P., The calculation of web impact factors (1998) Journal of Documentation, 54, pp. 236-243; Bar-Ilan, J., Search engine results over time - A case study on search engine stability (1999) Cybermetrics, 2-3. , www.cindoc.csic.es/cybermetrics/articles/v2i1p1.html; Rousseau, R., Daily time series of common single word searches in Altavista and northernlight (1999) Cybermetrics, 2-3. , www.cindoc.csic.es/cynbermetrics/articles/v2i1p2.html; Thelwall, M., The responsiveness of search engine indexes (2001) Cybermetrics, 5 (1). , www.cindoc.csic.es/cybermetrics/articles/v5i1p1.html; Thelwall, M., A web crawler design for data mining (2001) Journal of Information Science, 27 (5), pp. 319-325; Moed, H.F., The impact-factors debate: The ISI's uses and limits (2002) Nature, 415, pp. 731-732; Kim, H.J., Motivations for hyperlinking in scholarly electronic articles: A qualitative study (2000) Journal of the American Society for Information Science, 51 (10), pp. 887-899; Cronin, B., Snyder, H.W., Rosenbaum, H., Martinson, A., Callahah, E., Invoked on the web (1998) Journal of the American Society for Information Science, 49 (14), pp. 1319-1328; Thelwall, M., The top 100 linked pages on UK university Web sites: High backlink counts are not usually directly associated with quality scholarly content (2002) Journal of Information Science, 28 (6), pp. 485-493; Kling, R., Mckim, G., Not just a matter of time: Field differences in the shaping of electronic media in supporting scientific communication (2000) Journal of the American Society for Information Science, 51 (14), pp. 1306-1320; Crowston, K., Williams, M., Reproduced and emergent genres of communication in the world wide web (2000) Information Society, 16 (3), pp. 201-215; Haas, S.W., Grams, E.S., Readers, authors and page structure: A discussion of four questions arising from a content analysis of web pages (2000) Journal of the American Society for Information Science, 51 (2), pp. 181-192; Rehm, G., Towards automatic web genre identification - A corpus-based approach in the domain of academia by example of the academic's personal homepage Proceedings of the Hawaii International Conference on System Sciences (Big Island, Hawaii, 7-10 January, 2002).; Brooks, T.A., Private acts and public objects: An investigation of citer motivations (1985) Journal of the American Society for Information Science, 36 (4), pp. 181-192; White, M.D., Wang, P., A qualitative study of citing behavior: Contributions, criteria, and metalevel documentation concerns (1997) Library Quarterly, 67 (2), pp. 122-154; Oppenheim, C., Renn, S.P., Highly cited old papers and reasons why they continue to be cited (1978) Journal of the American Society for Information Science, 29 (5), pp. 225-231; Moravcsik, M.J., Murugesan, P., Some results on the function and quality of citations (1975) Social Studies of Science, 5 (1), pp. 86-92; Chubin, D.E., Moitra, S.D., Content analysis of references: Adjunct or alternative to citation analysis? (1975) Social Studies of Science, 5 (1), pp. 86-92; Thelwall, M., Methodologies for crawler based web surveys (2002) Internet Research: Electronic Networking and Applications, 12 (2), pp. 124-138; Thelwall, M., (2001) A Publicly Accessible Database of UK University Website Links and a Discussion of the Need for Human Intervention in Web Crawling, , www.scit.wlv.ac.uk/-cm1993/papers/a_publicly_accessible_dat abase.pdf; Cooper, W.S., Is interindexer consistency a hobgoblin? (1969) American Documentation, 20, pp. 268-278; Thelwall, M., Will MANs and SuperIANET dominate educational technology in the UK? (1999) International Journal of Educational Technology, 1 (1). , www.outreach.uiuc.edu/ijet/v1n1/thelwall// Wilkinson, D.; Sch. of Computing and Info. Technol., University of Wolverhampton, Wulfruna Street, Wolverhampton, WV1 1SB, United Kingdom; email: D.Wilkinson@wlv.ac.uk 1655515 JISCD English Journal of Information Science Article Scopus Wilkinson Harries Thelwall Motivations for academic web site interlinking: Evidence for the web as a novel source of information on informal scholarly communication Journal of Information Science 2003