Home Page
 About the Branch
 Special Interest Groups
 Events
 News
 Small Grants Fund
 Publications
 Feature Archive
 Mailing List
 Useful Links
 Find a Job
 Anglia Search
 
 Site Search
 

 

 Home > Archive > Digging for Victory: Finding Difficult Information

"Digging for Victory: Finding Difficult Information": a review of the East of England ISG workshop held 26th April 2006.

Candace Guite,College Librarian, Christ's College, Cambridge

This was an all-day workshop held in Norwich's new flagship Archive Centre. The entire collection has recently received designated status from the MLA and, during the lunch-time break, delegates were able to view a selection of the Archive's treasures on display.

The day was aimed at all sectors of the ISG membership base with librarians attending from public, academic, local government and special libraries and it provided plenty of opportunities for networking in between four excellent talks and the Group's AGM.  All speakers rose magnificently to the occasion when persistent technical glitches resulted in the failure of all internet connections but fortunately none of the substance of the talks was affected. The theme of the day "Finding difficult information" focussed particularly on the challenges of extracting the precise and the relevant from the plethora of information available online. Phil Purdy, Development Officer of the Museum, Libraries and Archives Council, gave us a practical working insight into recent progress on the People's Network and demonstrated how the provision of a high-quality portal to web resources can bridge the concept of the traditional library and the library without walls. Terry Kendrick, currently a freelance business consultant, considered the 'deep' or invisible web and looked at ways in which users can access these hidden resources. Alastair Allan, Senior Liaison Officer, University of Sheffield Library, looked at the complex area of e-Government, and in particular at the problems that can arise with diverse publishers and a bewildering variety of published formats. Jane Mortimer in her role as Academic Librarian: Resources and Services, at De Montfort University, considered support for the virtual user, taking as an example the students on the De Montfort distance learning programme.

The People's Network Enquire service

Phil Purdy gave us a detailed look at 'Enquire', the online reference service for members of the public launched in October 2005.'Enquire' (http://www.peoplesnetwork.gov.uk/enquire/index.html) is one strand within an overarching strategy (Enquire, Discover and Read) designed to ensure that members of the public are given the practical resources and the 'know-how' to navigate through and derive the best advantages from all e-Britain has to offer. 'Discover' (http://www.peoplesnetwork.gov.uk/discover/) provides a single portal to news, trusted web sites, and gives access to collections of digitised images from museums, libraries and archives. 'Discover' uses open-source software, allowing users to personalise the portal to meet their particular requirements and interests, and facilitates communication and the sharing of resources. 'Read' (http://www.peoplesnetwork.gov.uk/read/index.html) is underpinned by, but goes beyond, the basic literacy agenda, with a remit to encourage the exchange of ideas amongst readers. Users can contribute their own online reviews, and access reviews contributed by others. They can also link to online reading groups or contact a reading group local to where they live. All three interlinking strands of the People's Network are managed by the MLA, and are lottery funded at present. Future costs may devolve down to local authorities via a subscription scheme but the economies of scale should ensure that costs are kept low and that the service provides good value for money. A new and refreshing element, and vital to the success of the project, has been the close partnership achieved with UK and North American libraries, in order to deliver Enquire's national 24/7 service.

The new service is not intended to replace existing traditional reference services where the user approaches the reference enquiry desk to ask a question. Rather it should be regarded as a complementary value-added service, and because it is delivered through libraries has the public's traditional trust in the quality and professionalism of the service provided.

The value-added component includes more than just the 24/7 element. The powerful software OCLC-PICA (QuestionPoint) which underwrites 'Enquire' facilitates the pooling of expertise, a very useful feature when particularly difficult and complex enquiries arise. It also allows the librarians who staff the service to draw on a very wide range of regional, national and international resources, and then to link these, where appropriate, to local resources available to the user.

It appears though that the public sometimes labours under a misconception that 'Enquire' functions as a remote service, with a time delayed response but it is hoped that as users become more familiar with the service this will change. 'Enquire' does just what it says it does - http://www.peoplesnetwork.gov.uk/ - and provides a service which connects to a real librarian in real time. But your friendly helpful online librarian just might be in America or Canada if your question comes at 2 am.

The service is staffed on a rota system mostly by English public libraries (98 library authorities in England representing about 65% of public library authorities) between 9 am and 7 pm. Phil explained that the MLA hopes that this figure will go up to represent 100% and that in future academic libraries will be included too. Outside of English office hours it is staffed by North American libraries. 'Enquire' currently deals with about 60 enquiries a day, 72% of which are answered by English public libraries and 28% answered by international partners. Most of the enquiries received are answered using online resources, and feedback from users has been very positive to date. Two authorities, Kent and Essex, have pioneered 'Enquire' as a local service sitting alongside the national resource. It is staffed by their librarians during office hours and then switched over to the national resource out of office hours.

To download Phil's presentation click on the icon here

Information on the invisible and deep web

Terry Kendrick gave an inspired presentation on the invisible, hidden or 'deep' web. Terry's background in public libraries (he clearly relished his earlier incarnation as a reference librarian) has influenced his down-to-earth and practical approach to the problem. In Terry's estimate, only 6% of what is on the web is visible. It is an indication that you are entering the deep or invisible web when the WWW disappears from the top of your screen.  Terry suggested that if we consider the capabilities of just one search engine, the ever-popular Google for example, it can, at the most, spider off 10% of the web.

Terry went on to pose a number of 'What ifs?'. For example, what if the Google spider visits the site but goes away before searching the whole site? The trawl would then retrieve only some ofthe pages or what is just as bad, only parts of pages. What if the author of the web pages scanned by the spider had used a "no robots" command? Terry again drew from his own experience to give us an example with some of his PhD research (the application of risk management tools and techniques to strategic marketing planning) which is posted on the web. There are benefits in sharing research and in stimulating debate but clearly a web author would also want to exercise control over when and how that access happens. What if the material was put up last week or operates in "real time" and is dynamically generated?  What if the spelling is wrong? Some search engines suggest corrections and offer alternatives but not all.

  So for the user there is a real danger of bypassing just the kind of high quality resources likely to contain the answers to the questions posed, e.g. databases, library catalogues, private, and password-protected commercial and academic sites, picture collections and "mash-ups" ("mash-ups" combine programmable technology mashed up with a database). Terry's example of a mash-up was chicagocrime.org which links the Chicago Police Department's crime records to Google maps, allowing the user to make very detailed and precise searches, e.g. homicides on 45th St. The nature of much of this embedded and graphic material is simply not amenable to a conventional single engine search strategy approach. To balance this, new technologies capable of revealing parts of the hidden web are constantly evolving, for example, 'Bittorent' (this file-sharing software that facilitates the fast movement of files has all sorts of applications).

Terry gave us another example from PowerPoint. The notes from a PowerPoint presentation are embedded information but by selecting 'edit' slides the user can gain access to the notes pages, and thereby retrieve potentially useful and possibly commercially sensitive information. The slides notes are there, of course, to act as a prompt for the speaker but often what remains unsaid can be equally revealing.

Google is text driven and does the best it can but it might bypass an essential database. Terry gave us a scenario using Companies House and his own company as an example. For a typical business-related quick reference query, where the user wants to establish the date of incorporation of a company, it is best to go directly to the single most authoritative source, i.e. the Companies House website. This is a fast direct method that yields accurate information. But what happens if the user chooses to conduct a 'Terry Kendrick' Google search? An experienced user might well be aware, that different search engines yield different results according to the search algorithms deployed. There can be an overlap as low as 30% between two different search engines, so information which appears to be hidden may be so purely due to the limitations of the search engines used. Our user might know about the danger of collapsed sites (the phenomenon whereby a significant proportion of retrieved hits simply does not display), and is probably aware of the problems inherent in language e.g. in variant spelling forms, or of the issues that arise vis-á-vis the use of natural language vs. technical terminology. But assessing the accuracy, relevance and authority of the retrieved items can be more difficult. In the case of our Google 'Terry Kendrick' query, the user would be sadly led astray by one of the retrieved items, an interview with Terry Kendrick, in which the date given is wrong by a year or so: a mistake explained by a simple lapse of memory on Terry's part.

Our user might be less aware that when he performs a Google search the results are retrieved from a cache. How and when, that cached information is indexed and updated also clearly affects the outcome of the search. So what can the user do to find the information he needs? To view the scope and range of resources available, Terry suggested the use of lists e.g. 'Complete Planet' a directory of databases; 'Gary Price's Direct', links to hidden resources, and subject gateways. A good subject gateway will reveal the hidden web (e.g. databases) as well as the visible resources. Additionally the user with a good knowledge base of the subject, and one who can make an imaginative leap and visualise how the database layout attempts to map that subject clearly has the advantage. A final plea from Terry: "Don't give your brain away to a search engine!"

Searching for information on government web-sites

Alastair Allan helped unravel the complex maze of e-Government information.E-Government publication is all about making that information available in some kind of electronic format. There are huge potential advantages in the use of an electronic medium, for example in widening access, increasing speeds of transmission, and in building a genuine two-way communication with citizens. But Alastair suggested that the government has been less ready to consider the associated information issues that arise. Some of these issues are technical e.g. effective long-term archiving and some are social e.g. the digital divide and technological illiteracy.

The way that e-government is conducted varies but can include Freeview, e.g. BBC Parliament, Teachers' TV, digital televising of council meetings and SMS messaging as well as web publication. The advantage of these methods is that the public have direct access to all the information and can weigh up the debate for themselves as opposed to receiving it through a media filter.

Users need to be able to find information easily and to know which department produced it and when. The user also needs to know how to track the document again, should it be required for future reference, so long-term archiving, bibliographic and version control are as important as good web design. But good web-site design for e-Government is another area in which Britain lags behind Canada and America. Alastair suggested that the key features of a good government web-site must be accessibility and ease of navigation whether this is for the first-time user or the information specialist. Regrettably there is a dwindling number of government information specialists (they currently number only 28) and universities too no longer seem to be appointing or maintaining such specialists.

Alistair looked at the major obstacles to retrieval that face the user of government information. The first is related to the diversity of publishers and access to the published documents. The user cannot get hold of the information if he does not know who the publisher is. There have been fundamental changes in the way in which government information is published, so the search strategies and processes are as varied as the enquiries. This is manageable for the specialist dealing on a day-to-day basis with official publications but it presents a real challenge for the reference librarian who may only deal with such enquiries on an infrequent and irregular basis.

There used to be a single government publisher (HMSO) which operated with selected High Street outlets but now citizen and information specialists have to deal with an increasing number of conventional publishers, as well as the growing trend in desktop publishing Every single government agency can also act as a publisher, producing information in diverse data formats. There are for example, approximately 300,000 Home Office PDF pages. In this instance, unless a user knows exactly what he is looking for, or is lucky enough to have a web citation, his search is likely to be lengthy. It all makes for a compelling argument for a commitment, by all departments, to well-linked and structured archives. It also makes the case, as Alastair pointed out, that more information specialists are needed, not fewer. Our traditional taxonomic skills apply as much to web page and database construction as to catalogues. Alastair's suggested formula: "1 good information manager=2 brilliant graphic designers".

Once the user has identified the document required, the next hurdle is access. As we saw earlier, High Street availability has gone with the demise of HMSO and there is no longer any single central sales point. To make matters worse there is still no complete national bibliography and there are some gaps in the national library archive. To compensate, it is worth trying databases like COPAC, which contains 27 million subject links and is one of the best resources for government information.

There are portals designed to direct the user to the correct source, e.g. Directgov but in Aliastair's view British e-government documentation and provision compares unfavourably with that in the USA, where over 1081 federal publishers operate. A major difference is that the major American universities have very good collections which include Britain, and Alastair explained that it is often helpful to go through an American portal for hard-to-find British official publications. GODORT hosted by Northwestern University Library and the University of Michigan Library are particularly recommended. Alistair recommended the use of Google Scholar, or indeed any of the 'Advanced' features provided by the search engine to limit searches to the gov.uk domain name. The second obstacle in attempting to access government sites is that it is difficult for users to formulate questions correctly, and they may not be able to confirm that the information found is what they require.

There is a good case for saying that we all need information skills training and more investment is needed in librarians' training for the future. But we also need to ensure a wide distribution of leaflets that list the key local and national government web sites. Bearing in mind that some users have no keyboard skills or familiarity with web resources there is still a need for conventional publications to give, at the least, background support information.

Information provision in academic institutions

Jane Mortimer's paper looked at the needs of one very specific group of users, distance learners at De Montfort University, and considered the ways in which e-resources, combined with physical access, can support their study. The increasing range of electronic resources available combined with greater collaboration between HE institutions (e.g. UK Libraries Plus scheme) have helped equalise the gap between the full-time campus student and the distance learner.

All distance learners at De Montfort fall into a clearly designated group to ensure that they have full access to course resources: for example, there are some services which are offered exclusively to distance learners like the postal loans and photocopying services. There are numerous reasons why students may opt for distance learning but they include work, study, family commitments, geographical distance as well as personal choice. There are 16,222 full-time students (68%) but a sizeable proportion (24.2%) of the student population is part-time, with distance learners accounting for 7.5%.

Some courses at De Montfort are only offered as distance learning courses and there are also part-time courses which suit students on placement, involved in research or working part-time. These are typically offered through partnerships with franchise colleges (there are 1810 partner colleges).

De Montfort has taken steps to widen physical access to the collections by providing 24/7 access to the building, inter-library loans, links to external resources, and an online catalogue, thus ensuring that all students, both campus-based and part-time and distance, have maximum opportunities. Online reading lists link up with the catalogue.

Electronic resources at De Montfort comprise electronic journals, subject databases, e-books and reference materials, e.g. dictionaries, directories and standards. Electronic journals are accessed via OPAC and database links and from A-Z listings but, as with their print counterparts, their provision gives rise to significant cost issues. Subject databases cover both subject-specific and multi-disciplinary resources and can be full-text or selected abstracts. Jane spoke about the issues of equity and equivalence, using the provision of subject resources for business and management by way of example. These provide comprehensive full-text services, but students of art and design are less well served. The e-books collection is small, consisting primarily of background material rather than primary texts, and does not appear to be heavily used by students. Jane referred to a current research project at UCL to examine use of e-books by academic staff and students. It is a project that will be watched with interest by HE institutions which are waiting to take decisions on investment in remote resources. De Montfort also provides a number of other information resources, e.g. Web portals, digitised collections of examination papers, electronic course packs and so on. Students are also offered an off-campus library account, and have remote access to files (files created first on campus).

Offering such a wide range of services, and ensuring that they are correctly targeted to the part-time and the distance learner presents considerable technical and other challenges. Athens authentication provides the standard entrance route but students have different logins for e-mail and VLE, and a separate login and PIN for library use. User education and support is seen as a priority. Face-to-face inductions are given where possible but are backed up by CD, video, web-based tutorials, and online library guides and publications. Other services offered include telephone and e-mail support for users and virtual reference and enquiry facilities are provided via a web-based enquiry form. Jane concluded her paper by reminding us all that library services need to be marketed, hence De Montfort's @access anywhere campaign: at the very minimum, students need to know that the library web page is their gateway and that for all students, not just distance learners, there is much more on offer. However, ensuring that distance learners in particular take full advantage of the support offered presents a real challenge for the future.

All four speakers took joint questions at the end of the afternoon and there followed some stimulating debate on the whole range of topics covered. If you missed this East of England event make sure you register for the next one! It is a good way of keeping up to speed with fast changing and complex areas, and of networking with fellow practitioners.

 

You can download Phil Purdy's presentation on the Peoples' Network by clicking on the symbol above



CILIP (East of England) Branch
Charity No. 313014

Powered by Pure Energy