IBVRE Digital Paper Evaluation Report
Transcription
IBVRE Digital Paper Evaluation Report
Integrative Biology VRE Work Package 7: Digital Paper Evaluation Report Matthew Mascord Marina Jirotka Annamaria Carusi Intended Audience The purpose of this report is to give an account of the evaluation of Anoto digital pen and paper technology conducted as part of the Integrative Biology VRE project at the University of Oxford. The audience for this document includes the digital pen and paper developers and manufacturers, as well as those researching paper augmentation or replacement technologies. It is also likely to be of interest to the wider VRE development community, especially those projects funded as part of the JISC VRE2 programme. Issued: 17 April 2007 1 Executive Summary The JISC funded Integrative Biology VRE project aimed to investigate and develop the Virtual Research Environment concept in a computational biology setting by extending and complementing the work of the EPSRC Integrative Biology (IB) project in the areas of user interfaces and collaborative tools. IB is building e-Infrastructure to address the needs of an interdisciplinary and globally distributed community of heart and cancer modellers researching the causes of colorectal cancer and cardiovascular disease. Following an analysis of research processes across the IB user community we chose to evaluate Anoto Functionality, a leading digital pen and paper technology, to address the community's pressing need to better manage their paper-born material, generated through a range of key research practices such as mathematical modelling, data taking, and the keeping of laboratory notebooks. Twelve Logitech IO2 digital pens were distributed to heart and cancer modellers in the IB community to use for three months. Their experiences in using the technology were reported back through a miniworkshop on digital paper, one-to-one interviews, and through an online survey. The following gives a summary of the key benefits, issues and requirements that emerged. Conclusions and Recommendations 1. Anoto Functionality can bring significant benefits to the researcher who uses paper as a fundamental component of their research practice, for example, by providing a backup, facilitating sharing, making material easier to locate, and offering opportunities for capturing information digitally in situations where a tablet or PDA is unavailable or cumbersome. 2. Although the sample community in this study expressed little desire for a structured online repository for digital notes, experiences reported by other projects, such as the evaluation at CERN, have shown this concept to be highly relevant in situations where there are already well established and justified practices of regularly sharing paper-based material amongst collaborators. 3. The significant interest in the possibility of converting handwritten equations to LaTeX (a mathematical typesetting language) appears to merit further research in this area, building on work such as PenMath at the Ontario Research Centre for Computer Algebra. 4. If handwriting recognition is to be offered to the scientific community as a key value proposition in future products, further work is required to make it more reliable for scientific text. Support for Greek and common short forms needs to be supported by default, as well as simple table and column recognition. There also needs to be a straightforward way to tag sections to be excluded from this processing. 5. Special tags to associated pages together, a document annotation solution, a Mac/Linux version, and integrated search were the requirements deemed most important by evaluators and as such should be seriously considered by those developing tailored solutions for this community in future. 6. In the scientific community there are well established practices of building and exploiting home-grown software to meet highly particular needs not catered for by commercial suppliers. We would strongly encourage Anoto and its partners to publish their file formats so that those in the scientific community can build on the technology, making it more useful and facilitate a wider take-up. 7. To encourage a wider adoption, Anoto technology suppliers should consider allowing end-users to print out their own paper, and to implement greater interoperability between paper formats produced by different suppliers. Allowing end users to print out their own paper on scrap could alleviate some of the environmental/cost concerns associated with the use of the special paper. 2 Introduction The Integrative Biology VRE project was a JISC funded Virtual Research Environment project based at the University of Oxford that ran for two years between April 2005 and March 2007. Led by Professor David Gavaghan, it aimed to investigate and develop the VRE concept in a computational biology setting by extending and complementing the work of the EPSRC Integrative Biology project. The Integrative Biology (IB) project is a second-round EPSRC e-Science Pilot project which commenced in February 2004 and is funded until January 2008. It received initial funding of £2.4M to support the research of nine post-doctoral researchers, and to provide training for 10 PhD students, across seven UK Institutions (the Universities of Oxford, Nottingham, Leeds, UCL, Birmingham and Sheffield, and CCLRC), IBM, and the University of Auckland. The IB project's scope is a combination of science and technology: on the science side, it aims to further understanding of the causes of cancer and cardiovascular disease through a combination of mathematical and computational modelling; on the technology side, it is building the underlying Grid infrastructure required to support this research. This infrastructure follows a service-oriented Grid architecture, and is utilising many of the middleware developments within the UK eScience Programme and in the wider Grid community, and within the JISC Middleware programmes. Whereas the IB project has focused on constructing a set of generic middleware services that can be composed to support a diverse range of activities, the Integrative Biology VRE project was set up specifically to investigate and address the user community's requirements in the areas of graphical user interfaces and collaborative tools. In an initial requirements gathering exercise carried out at the start of the project [3], one of the highest priority requirements to emerge was that of providing better management of paper-based material produced through a range of research practices within the IB community. This report describes the findings of an evaluation of a bespoke technology designed to address this - Anoto Functionality - a digital pen and paper solution based on the use of real pen and paper. 3 The Integrative Biology Community The two main user groups within the IB community are the heart and cancer modellers. The heart modellers have a long history dating back some forty years to the pioneering work of Denis Noble (1962). In contrast, cancer modelling, particularly the modelling of colorectal cancer currently being tackled by the IB cancer modelling Tumour Task Force (TTF) is relatively new. Heart modelling groups within the community tend to be globally distributed but tied together through the common use of a large-scale simulation package, used on a day-to-day basis to perform in silico experiments. There are several examples of these in use by the IB community. However, the most commonly used are: Memfem [4], used by groups at the Johns Hopkins University, University of Oxford, and Tulane University; and CMISS [5], written by Peter Hunter and researchers at Auckland University, New Zealand. In contrast, the cancer modelling groups are largely UK-based, at Nottingham, Birmingham, and Oxford Universities. The aim of the IB project is to support this community in its initial attempts to build a comprehensive model of cancer development across multiple spatial and temporal scales. However, at present, the majority of their work is mathematical, and fundamentally paper-based, with some use of symbolic manipulation packages such as Mathematica or Maple to provide quick answers. Individual researchers do perform small-scale simulation work in packages such as MatLab, but this work is mainly to validate and provide proof of concept for the fledgling mathematical models. 4 The Use of Paper within the Community Within our initial analysis of research processes across the IB community, we found that paper was used by researchers in two quite distinct contexts: either as the principal medium through which research is conducted, or to provide a record or diary of work that has been, or is in the process of being carried out. Paper was also often used to help an individual manage research tasks on a moment-to-moment basis, for example, in the form of 'To do' lists, or Post-it® notes stuck to a computer monitor. When paper is the primary tool used for carrying out the research, as is the case with the mathematical modelling of cancer tumors, there did not appear to be a need to keep a separate record. Typically, these modellers produce mathematical formulae on scraps of paper or whatever is at hand. These scraps may then be collected some time later and ‘filed’ in boxes or drawers. Some order may be imposed by stapling together chunks of related of material before filing, for example, a combination of notes, plots, figure cutouts, and calculations. Further uses of paper, relevant to both the cancer and heart modellers included the drafting of academic papers, recording thoughts, note-taking at seminars and making notes from papers. Whether or not material is kept in the long run is an archival decision taken by the individual researcher; for some materials the decision is easy, for example, previous drafts of papers are not usually kept, as anything worth keeping would be typed up. However, while this preliminary or incomplete material may not be useful for a current project, it can often turn out to be very useful in the future, when it can be further expanded upon. Paper tends to be used as a record-keeping medium where the work has been conducted outside the medium of paper, for example, performing experiments in a lab, or the computer-centric design of in silico experiments. The usual form for this record-keeping activity is the traditional lab book. These books have been identified as forming a critical part of the modellers’ research practice. One heart modeller whose research examines the electrical activity of the heart through computer simulations, said ,'it would be a disaster to lose my book'. In these settings, entries represent the evolution of scientific discovery and as such the experimenter notes the procedures followed, methods used, results and any ideas or observations that occur during the work within the lab or outside of it. It is used to record thoughts and input parameters to experiments; build up and record reading lists, stick in figure cut-outs from papers and results. Selected pages from the book were sometimes photocopied and shared with others if people needed them. It is important to note that the lab books are written for specific groups of audiences, namely other scientific researchers, so that others may follow the steps and approaches taken. Consequently, specific practices have evolved, for example, that entries are chronological not logical. This means that if a solution to a past problem emerges it should not be added to the pages dealing with that problem. Rather it should be listed chronologically, on the day it occurs, with annotations to show to what it refers. In addition, they may also be used as a source of evidence to establish intellectual property [6] [8] [15]. Other researchers use online blogs or wikis, as an alternative, or in addition to, their paper-based lab book. The reasons for using paper rather than an online equivalent are sometimes simply because the researcher prefers it, but also fundamentally due to its convenience - often it may be impractical to use anything else. For example, heart modellers when performing experiments on heart patients, use lab books to record data during the experiments. Though they may use PDAs on a regular basis, paper is preferred in these situations as it is seen as more flexible and reliable in the narrow time windows available for these experiments - which are conducted in the middle of heart surgery. In this case, the lab book is used as an interim storage solution; the data is always subsequently scanned in for analysis by colleagues. These findings resonate with research in CSCW and workplace studies on the affordances of paper [7]. This research has suggested that often technological devices such as PDAs, as in this example, become the ‘interactive domain of interest’ instead of the activity the practitioner is engaged in such as the heart surgery in this example. Entering data onto a PDA or other technological device requires the practitioner to examine the screen to look at what is being typed in. Writing with pen on paper is more immediate and does not require the extra step of checking. 5 Anoto Functionality There have been several technological approaches to addressing the problem of capturing handwritten text digitally. Such solutions rely either on the use of special paper, special pens or some combination of the two. Prior work, for example SmartTea [16], has addressed this through the development of bespoke digitising tablets. However, we sought a solution that would be as close as possible to the look and feel of writing on real paper, and so chose to evaluate Anoto functionality [9], the market leader in digital pen and paper solutions based on the use of real paper. Anoto technology is based on a special pen, and paper imprinted with a barely visible dot pattern. The pen uses ink, and writes just like a normal ballpoint pen; equally, the paper is real. However, the pen is also equipped internally with a digital camera, image processing unit, and communication device. When the user writes on the specially imprinted paper, the digital camera, by recognising the dot pattern, captures the movements of the pen across the paper, as well as the pressure and angle of the pen on the paper. Data stored in the pen can be transferred to a computer for view, storage, or manipulation through a docking mechanism, or for those pens that support it, bluetooth. Bluetooth functionality also allows data to be sent to others while on the go. For example, the Nokia Digital Pen SU-1B [10] allows handwritten notes to be sent to others with compatible devices via a mobile phone's multimedia messaging service (MMS). A wide range of commonly used paper products have been manufactured to include the Anoto dot pattern. For example, Black 'n' Red notebooks, A5 spiral bound notebooks, A4 pads, and even Post-it® notes. Such products often include printed checkboxes that control the operation of the pen itself. For example, all notebooks have a 'New Notebook' checkbox, which must be checked when starting a new book to inform the software that all new pen data should be stored in a new file. Another example is the use of gatefold pages containing checkboxes to control the colour and thickness of the writing as it appears in the digital pen software. The types of digital pen products adopting Anoto functionality range from those sold with consultancy by business-to-business system integrators constructing bespoke solutions intended to be tightly integrated into a particular organisation's business processes, to products sold by consumer facing firms providing more plug-and-play, multi-purpose solutions. As we did not want to make any assumptions about the types of activities the technology would be used for, the main criteria for technology selection were that the technology should be multi-purpose, support a wide variety of paper types, yet be very easy to set up. This rationale led to the decision to use the Logitech IO2 Digital Writing system [11]. 6 Evaluation Methodology The overall intention of the evaluation was two-fold. It was first to address a range of needs at a very practical level, but it was also to identify requirements on how this technology might be developed further, particularly with respect to its integration within VREs. The methodology chosen aimed to examine the use of the technology in as naturalistic setting as possible. Rather than prescribing set tasks, we wanted to allow for the use of the technology for more freeflowing activities. By embedding the pens in the researcher's context, they acted as a probe [12] or intervention in the researcher's environment, allowing us to get at this context. Hence, we adopted a diary study approach. In diary studies [13], participants using a technology are asked to record observations of their use of a technology in a diary throughout the day. In this case, however, the very technology the researchers were trialling was used to record observations. Participants were asked to keep an informal log of their experiences of using the pen and paper at the back of the supplied Anoto notebooks. In April 2006, we conducted a two week preliminary study, involving two members of the Mathematical Institute at Oxford. The purpose of this was both to inform our main evaluation but also to contribute to a poster and demo at the UK e-Science All Hands Meeting 2006. In the main evaluation, carried out between July and September 2006, twelve members of the IB community were involved, mostly heart and cancer modellers at various academic levels, from graduate student to professor. Experiences of participants involved were elicited at a mini-workshop on digital paper, follow-up interviews, and through an online survey. 7 Preliminary Study At the start of the preliminary study in April 2006 we had not yet ordered the Logitech IO2 Pens but already had to hand a pen supplied by Maxell, and so we decided to use it for this initial investigation. The DP-201 Bluetooth/USB pen, packaged inside the Maxell Digital Pen Evaluation Kit, also shipped with a cradle for docking, a spiral-bound A5 notebook, and the Digital Note Viewer software. Docking the DP-201 pen caused the pen data to be uploaded to the Digital Note Viewer software, and deleted from the pen. In common with other products adopting Anoto functionality, the data was saved as a set of vectors within a Pen Generated Document (PGD) file, recording the movements of the pen across the paper. This data storage strategy meant that file sizes are typically much smaller than an equivalent image of a scanned page, even when the image is compressed. The Digital Note Viewer software presented the pen data as a virtual notebook, a series of pages mapping directly to the physical book, and allowed the data to be viewed, manipulated, and converted to a variety of image formats for transfer to standard office applications. Pen strokes could be viewed as-is or 'replayed' within the software, exactly as they were created (see figure 1). Figure 1 - Digital Note Viewer In this preliminary study, the pen was used for a wide range of activities, for example, drafting sections of a paper to be submitted, checking calculations, note-taking in a library, and small amounts of cancer modelling. It was also used in supervisions with students and post doctoral researchers. Design of the Pen The researchers found the pen reasonably easy to use while slightly fatter than pens they would normally use. One researcher commented that her handwriting was slightly worse when using the pen but that this was not a major issue. There were no problems with the life of the battery but, apart from the example of its use in a library, pens tended to be used in the office where they would be docked (and hence charging) when not in use. Design of the Software The software provided as part of the evaluation kit, the Digital Note Viewer, was seen as basic but easy to use. The PGD file could either be kept untouched or manipulated as required: this was seen as both an advantage and a disadvantage. One modeller liked the fact that pen data could be manipulated as it allowed her to simulate, in the digital note viewer, the protocols she had developed with physical paper of gathering together, and stapling related material. However, it was seen as crucially important to always keep a readonly master copy, recording everything that the pen writes sacrosanct, so that it is always possible to go back to the original in the event of hardware, software, or human error. It was suggested that the software might do this automatically, offering only copies for manipulation. Another safeguard suggested was that the software should automatically save the pen data to the PGD file upon docking, rather than the responsibility for this lying with the user. The software as it stood left open a small risk of data loss for users not properly educated on the importance of saving the PGD file immediately upon docking: this was encountered after losing a small amount of material this way. Small file size was seen as a major benefit, although this was only realised when all manipulation of pen data was performed within the Digital Note Viewer; when exported to other packages, the size of the pen data increased considerably. Further features requested were for some support for highlighting material in the viewer, and being able to paste in figures/cut-outs from papers, or plots from mathematics packages, ideally from within the Digital Note Viewer software to retain the advantages of the small file size. Paper Formats There were two aspects of the paper format that were considered: the size, and the binding. For taking notes from seminars and papers, the A5 size was seen as ideal, and the spiral binding useful, to ensure pages do not go missing. However, for mathematical modelling, the A5 format was seen as too small, with the researchers finding they were running out of space. Additionally, the binding was seen as an impediment, as some researchers like to arrange pages in front of them whilst working to be able to see everything at a glance. A4 pads appeared to be the ideal because they are large enough for the mathematics and also allow pages to be ripped out. Whether or not the paper is lined, plain or squared was regarded as unimportant. According to the help documentation provided with the digital pen, it was not possible to use multiple books of the same type at the same time. Using multiple books of the same type with the same pen would result in an irreversible 'double exposure' effect, with pen data from multiple books overlaid within the same page in the Digital Note Viewer software. One way round this, was to always tick the 'New Notebook' tickbox, before switching to a different notebook - forcing the software to prompt for a new PGD file upon docking, removing the possibility of the double exposure effect, but at significant user overhead. This was considered to be a real limitation of the technology as it is common practice to keep different notebooks for different projects, and separate books for seminar notes, notes from papers, as well as for work with students. Having only one A4 pad on the go at once would be less of a problem, as pages within these are not normally kept together as one unit, but are removed and stapled with related material. Organising Material for Efficient Retrieval It was thought that the pen would assist in the management of large amounts of hand-written material, for example, by keeping track of the volume, being able to categorise it, and keeping it safe. 'Safe' was defined here in terms of being able to find it again, rather than protecting it from physical destruction. Researchers may encounter difficulties in organising the paper material, and can spend a large amount of time looking for material but not finding it. Some find electronic documents easier to organise than paper, mainly because material can be named and organised into directories. For this reason, some type up a lot of material simply in order to be able to find it again. It was commented however, that a longer study would be required in order properly to assess these benefits, to allow a larger volume of material to be accumulated. It was thought that Optical Character Recognition (OCR), if good, would be useful, but problematic as mathematical calculations do not tend to consist of long strings of words but are more likely to consist of isolated words interspersed with equations and diagrams. Its main use might be to enable the search for seminar and paper notes, rather than to convert to text. There was however, interest in any tool that could convert hand-written mathematics directly into LaTeX [14], a typesetting language for mathematics, as this would save everyone a lot of time. Using Digital Paper for Email There was also strong interest in the idea of using the technology in conjunction with email, particularly when collaborating with non-local collaborators. Difficulties are encountered using email for mathematics because it is a predominantly text-based medium, whereas in order to communicate mathematical ideas effectively, an equation or diagram is nearly always required. It was reported that some groups of mathematicians try to get round this limitation by writing commands from LaTeX directly in the email, for example '\int' for the integral sign. A far better solution, however, would be support for sending handwritten equations or pictures directly in email, a feat trivially accomplished with digital pen technology. When probed as to why mathematicians did not use scanners or faxes to achieve this same objective, researchers maintained that this would entail a lot more work due to the extra step of finding and using the scanner or fax machine which are rarely at hand. However, there were limitations as the technology worked only on Windows, and some researchers used a text-only email client, Pine, on a Linux desktop machine. The problem here would be that the pen-generated image would need to be transferred from the laptop to the Linux desktop over FTP before it could be sent in an email. This extra step might well be a significant disincentive. However, having the capability to send pen data over email might be an incentive to switch to a graphical email interface. Another related possibility that wasn't explored as part of this evaluation, was the use of digital paper in conjunction with instant messaging (IM) applications. Using Digital Pens for Training Some researchers tended to do less work individually, and saw the main use of the technology as aid to supervisions of students or post doctoral researchers. During these meetings, students bring work they have completed and student and researcher work together on a problem. Usually the student will take the resulting material away at the end of the meeting and the researcher does not see it again until the next time they meet, assuming that students are organised enough to bring it. If the digital pen were used in these meetings, there would be a digital record of everything that was written, even if the paper record was taken away by the student, and so it would be easier for the researcher to keep track of exactly what was discussed. One limitation of the digital pen technology, however, is that it is not possible to use multiple pens with the same pad, and so the pen would have to be shared by both the supervisor and supervisee. This was not seen to be a problem as the researcher would normally be doing most of the writing. The only place where the technology would not work would be when writing over manuscripts. 8 Findings from the Main Evaluation The mini-workshop brought together those who had evaluated the technology as part of the Integrative Biology VRE project, as well as a number of those who had evaluated it, or were planning to evaluate it, in related outside projects. The aim here was to allow these people to share their experiences and help define priorities for future development, ascertaining any need for the tailoring of this technology specifically for the IB community, or integration of the technology within a VRE. Those invited from outside the project included Paul Luff, co-ordinator of the EU PaperWorks project at Kings College London; François Grey, head of the IT Communications Team at CERN; and Jeremy Leighton-John, Curator of Digital Manuscripts and Scientific Curator at the British Library. Both Paul and François had evaluated the technology in previous projects and were interested in the experiences of those working within an integrative biology context; Jeremy was interested in how digital paper might change the practices of note taking in the field. Ruth Kirkham, Project Manager of the Building a VRE for the Humanities (BVREH) project, also attended in the hope that the workshop might help inform a planned evaluation of Anoto Functionality with humanities researchers. Workshop Agenda The agenda was designed to allow everyone who had evaluated the technology at least ten minutes to describe their experiences, before participating in a session dedicated to prioritising any emerging issues or requirements. Paul Luff helped frame the discussion initially by introducing a number of projects and possibilities relating to paper augmentation. There was also a final session, aimed more at those researching the technology, to address the wider question of paper augmentation and replacement. Round-the-table Summaries In this session, eight individual experiences were presented. A good range of usage situations were described, from basic note taking in meetings to the more advanced examples of using the digital pen and paper to record data during experiments. The most common use was using the pens to take notes in meetings and at conferences. One of the more innovative examples was using the pen to prepare material for presentations. Figure 2 - A Mathematician using Digital Paper for Cancer Modelling Evaluators' attitudes to the technology following this evaluation were wide ranging, from the fairly negative to the highly positive. On the negative side, there were examples of users experiencing serious problems with the software conflicting with other applications running on their computers. In one instance this led to an evaluator giving up after only one week. On the more positive side, many participants were very enthusiastic and fully intended to continue to use the technology even after the end of the formal evaluation period. Key Benefits The aspect of the technology people liked the most was having a digital copy backup of the hardcopy and vice versa. One mathematical modeller used the pen for keeping notes at conferences and mathematical modelling, and found it useful that even if the hardcopy was thrown away or lost, the digital copy would remain. On the other hand, another participant described how she had had problems with her PDA that had led to data being lost – an advantage with the digital paper is that even if the digital copy is erased, the hardcopy remains. The other major benefit reported was how much easier sharing material became. Examples were cited of mathematicians sometimes scanning or digitally photographing material and sending it. The main benefit of the digital pen is that it does not rely on having a scanner or digital camera to hand - if the pen is being used on a day-to-day basis, material can be sent instantly over email, as-is. The fact that digital notes are stored in a vector graphics-like file format, also makes such emails more space efficient than an email with an equivalent JPEG or TIFF attachment. Other positive things evaluators said about the technology included: • • • • • • • • It was useful when the tablet/PDA was unavailable or cumbersome. It was good for writing letters and emails, and simple diagrams. It was good for writing equations even though it doesn't convert them. The saved output is very close to the written page. A presentation prepared using the pen looked nice on screen. It had great battery life. The memory was ample. The pen size easy to get used to. Quality of Handwriting Recognition The most universal complaint was the quality of the handwriting recognition system, a feature absent from the digital pen software used in the preliminary study. The handwriting recognition appeared to be reliable only for plain handwritten text, without any scientific short forms, equations, diagrams, dates, or Greek characters. Even for plain text, participants had mixed results, with reliability appearing to depend on the person, and their handwriting style. One of the evaluators thought he had bad handwriting and was surprised that the pen software managed to recognise it. On the other hand, another found that the system would not recognise her handwriting even after running the training application four times. An evaluator based at the Oxford Physiology department reported that the plain text conversion tended to perform worse with 'decorated' pages, for example, where the text contains doodles or diagrams, or where certain words are circled. When questioned further about their need for handwriting recognition, it often appeared that this derived from the requirement to be able to search the digital notes - a feature absent from the Logitech IO2 software. Not everyone in the group was fully aware of the Logitech Google Desktop plugin that has the capability to transparently index and OCR digital notes residing on a computer in the background. Related to handwriting recognition was the requirement for the recognition of simple tables. An experimentalist based in the Physiology department described how she frequently writes down data in tabular columns during experiments in the lab, and found that the Logitech IO2 software was unable to recognise them. She had also tried an evaluation version of another package, MyScript, which provided basic table support but still lacked recognition of scientific short forms. Other Issues The most serious issue with the Logitech pen encountered was related to an intermittent fault with the pen ink sensor. Four of the participants experienced this fault to greater or lesser degrees. The sensor appeared to stick, so that even when the pen was lifted off the page to start a new mark, it would continue to register the underlying pattern, resulting in a scrawling effect Figure 3 - Example of Scrawling Problem - the intended text here is 'Scrawl test'. A solution to the issue was eventually identified, though too late to be of use to those involved in the evaluation. Applying a small amount of lubricant, in this case WD40, to the pen ink refill seemed to free the mechanism allowing the pen to register correctly when the pen was in contact with the page. Note that there were no instances of this issue experienced with the Maxell DP-201 pen. A rare but serious issue affected the performance of the software itself. In two cases there was a conflict with other software running on the evaluators' machines resulting in 100% CPU utilisation. A third participant didn't encounter this problem but would have preferred it if the software were not memory resident and for it to be possible to manually upload notes from the pen rather than this happening automatically every time the pen is docked In terms of more fundamental issues with the technology, one concern expressed was related to the cost and environmental consequences of the use of the special paper. One of the mathematical modellers described how many mathematicians write on any paper they have to hand, mostly scrap, and even napkins. Unless useful, much of this material is thrown away. It was felt that using such high quality paper at all times would be both costly - as the paper is more expensive than premium standard paper - and wasteful. The other related issue here is the fact that whilst you might have the pen to hand, you may not have a convenient supply of Anoto paper. Suggestions The mathematicians liked to spend time organising notes after upload, for example, by distilling conference notes to make them more useful. One mathematician when preparing a presentation using the technology liked the way text could be moved around, and was particularly pleased with how the end-result looked on screen. On the other hand, many did not like the way the software stored each notebook page as a separate file they would have preferred it if file organisation could be controlled from the paper itself through the use of special tags, to enable, for example, each talk to be stored as a separate file, rather than every page. Other suggestions included the following: • Ability to write over the top of presentations while delivering them, for example, as an alternative to OHPs. • Ability to record audio notes on the pen where writing things down is too time consuming • Support for Linux and Mac - all evaluators had access to Windows but many used a Linux machine as their main workstation. • A wider choice of ink colour, beyond blue and black, to allow for example the use of different colours in diagrams: one for the equations, and another for the boundary conditions. Experiences at CERN François gave a general overview of science carried out at CERN, noting some of the larger particle physics experiments and their need for a distributed computing infrastructure due to the vast amounts of experimental data generated. He also introduced work by summer school students on the OpenLogBook project [17], to develop an electronic lab book solution for a number of scientific groups at CERN. The bulk of his presentation concerned work to extend OpenLogBook with support for digital pen and paper, and the evaluation of this technology with three research groups at CERN. The biggest breakthrough for the technology was with the Athena antimatter experiment [18], a collaboration of around 100 people and 20 institutes. Athena use pen and paper log books as a central part of their data taking; in the course of a year, 30 paper log books are filled in by control room operators, recording key qualitative data relating to the experiment. At the end of the year, a secretary takes the stack of thirty books, copies them, and distributes to the 20 partners. Scientists in each institute are then able to see all data captured throughout the year. In 2004, Anoto digital pen and paper technology was introduced into the Athena collaboration as a replacement for their paper log books [19]. In addition, a simple password protected web site was developed to allow digital notes to be deposited and shared amongst partners. The introduction of this technology meant that the log book information could be shared immediately with remote partners, increasing the interactivity scientists had with the people in the control room. The fact that the digital notes were time-stamped also introduces the possibility of making connections with other related digital data. In 2006, a more general evaluation of the technology was conducted at CERN. 25 Logitech IO2 pens, provided free by Logitech were given out to test users. The outcomes from this were as follows: Benefits: • Sharing paper on web very useful (as mentioned) • Password protected. • Portability. • The ability to quickly compose and send hand-drawn sketches. Weaknesses: • No ability to associate printouts, and graphics into the digital notes. • No annotation solution. • No Linux/Mac solution. • No solution for annotating the online version of the notes, only the original paper copy. François also mentioned the fact that they would like to get the stroke timestamp from the Anoto XML file, which is currently encrypted. This led to François later indicating the desirability, from his perspective, of Anoto publishing its file formats as open standards, to allow such information to be interpreted and used more widely. Outcomes from Open Discussion This open discussion, facilitated by Marina Jirotka, was designed specifically to prioritise the requirements and issues already established. As only three of the original evaluators could stay for this session the outcomes were not, unfortunately, representative. The session turned out to be more concerned with elaborating previously mentioned issues/requirements, as well as uncovering further requirements that had not already been raised through the round the table summaries. According to the evaluators remaining, the most important issue to address was the scrawling. One of those experiencing this felt that if she continued to get it all the time, she would stop using the pen. Next on the list was addressing the post-processing issues. It was described how the Logitech IO2 software's method of saving each individual hardcopy page as an individual file led to the need for a lot of post-processing work to associate related material together as one file. There was the suggestion of introducing unobtrusive formatting marks to control the treatment of notes after they are uploaded into the software. By drawing lines in the margin, the user could indicate which sections of each page to go into one file, and the name of the file. For example, the latter half of page 2, page 3 and the first half of page 4 should go into a file called X. The group also felt it might be useful to have tags to indicate sections to completely exclude from upload. Another related issue was the ability to associate print-outs or diagrams with the digital notes. In the hardcopy, printouts and diagrams would be stapled to related mathematical material. In the Logitech software there was no ability to associated non-digital note material in the editor. Although none of the participants expected the technology to recognise mathematical equations, the possibility of translating these was briefly discussed. Though most doubted the feasibility of accomplishing this, those present thought that if it was possible, LaTeX would be the most useful format to convert to. It was commented that other experimentalists may want conversion to other mark-up languages such as ChemML. Prior work to convert handwriting to MathML [20] was mentioned, and there was the suggestion that this work could be extended to work with the digital pens, by integrating existing open source tools to convert MathML to LaTeX. Some small but important usability issues with the pen were raised. Two participants found that the pen would buzz and beep in certain situations. One thought it was related to the scrawling problem; another thought it occurred when writing too close to the centre divide of the Anoto notebook. It was reported that this buzzing and beeping would seriously hinder their concentration, preventing them from using the technology. Another issue reported was the way in which the Logitech IO2 pen would automatically turn itself off after a period of time, if the cap was left off. Two reported this problem and found that it led to note loss because it was not always obvious that the pen was off. In terms of integrating digital notes into a VRE, there was some discussion about the need for providing some form of online repository for digital note data, such as the one developed by François Grey's group at CERN. One participant who was not involved in the IBVRE evaluation, described how this would be a very good idea from his perspective because his group frequently share data between collaborators. Notes are taken, scanned in, and PDFs sent across - a shared web page that everyone can log into would be a really good idea. The question of tagging notes to be viewable by others was raised, but none of the participants expressed an immediate need for this. Although only mentioned near the end of the discussion, the ability to print out documents on the digital paper and annotate was raised as a key feature missing from the Anoto digital pen and paper technology. One of the mathematical modellers described how he would often receive a draft via email but could not send handwritten comments back, which results in him posting or writing the changes into an email. Drafts get bounced about a lot. If the pen can only be used for the first iteration, it is not so useful. The group felt sure there would be people working on this and mentioned work with publishers to carry out their editing through the use of digital paper. The standard set of proof editing marks could be recognised by the digital paper software. Near to the end of the discussion, the question of the possible personal identification of digital notebooks was raised. It was noted that the Logitech books have bar codes, and that other people had been experimenting with RFID tags. One of the participants felt it might be possible to integrate an identifier into the dot pattern itself. It was pointed out that the Logitech pen can hold the user's name and email address, and the potential for privacy concerns. No one among the remaining evaluators expressed a particular need for this. Online Survey An online survey was developed following the workshop to help establish a prioritised list of issues and requirements. This was a short structured survey built using the free SurveyMonkey [21] web-based tool. It was based on the list of usage situations, issues, and requirements gathered through the workshop, and follow-up investigations. Ten out of the twelve evaluators responded. Figure 4 - Digital Pen Survey Respondents were asked to rate each issue and requirement in order of importance, from 'Very Important' to 'Not Important'. In order to translate this importance matrix into an ordered list, a simplistic scoring algorithm based on weighted sums was introduced. This assigned 'Very Important' a weight of three, 'Somewhat Important' a weight of two, and 'Important' a weight of one. An issue considered by all respondents to be 'Very Important' would therefore score thirty whereas an issue considered 'Not Important' by all respondents would score zero. Score = 3 x <Number of Respondents Rating as Very Important> + 2 x <No. Respondents Rating as Important> 1 x <No. Respondents Rating as Somewhat Important> The following list of usage situations validates the findings from the workshop that the most common use was for taking notes in meetings and conferences. It is also useful to bear in mind while considering the issues and requirements users felt most important. Rank 1 2 3 4 5 6 7 8 9 10 11 12 Uses Taking notes in meetings Taking notes at conferences. Mathematical modelling. Preparing material for presentations. As a replacement for a traditional laboratory notebook. Taking notes from papers. Writing down data relating to simulations. Writing letters and emails Drafting proposals To work through a problem in supervisions. Writing down data during experiments in the lab. Checking calculations in a to-be-submitted paper. Occurrences 7 6 4 4 3 2 2 2 1 1 1 0 13 Drafting papers 0 As the following ordered list of issues shows, the problems with the handwriting recognition were ranked as most important by the majority due to the fact that it was a problem encountered by virtually everyone. The scrawling and software conflict problems, though critical for those experiencing them, were only a problem for a minority of participants. Rank 1 2 3 4 5 6 7 8 8 9 10 10 10 11 Issue Handwriting recognition limited/faulty Time spent organising after upload Spidering/scrawling problem Handwriting recognition: scientific shortforms/greek not recognised Upload time Handwriting recognition: 'is' converted to 13 Handwriting recognition: dates not recognised Bulky Capture frequency too low Pen going to sleep if cap left off leading to loss of notes. Beeping/buzzing Capture resolution too low CPU Utilisation Problem Cost/environmental issues Score 22 21 20 19 18 16 15 14 14 13 11 11 11 10 The following table gives a list of requirements for future development, in order of perceived importance. Ability to search came out top, with the ability to do more sophisticated post-processing next on the list. Ability to annotate documents was also a requirement very near the top of the list. Interestingly, one of the features we expected users to want most, the provision of a form of shared online repository for the digital notes, came out relatively low in the priority list. This finding points to a low level of interest in the integration of this technology in a collaborative virtual research environment, at least within this cross-section of the heart and cancer modelling community. Rank 1 2 3 4 4 5 6 7 7 8 8 9 9 10 Requirement Ability to search digital notes. Tags to indicate which pages are saved as a single file. Ability to print out documents on digital paper and annotate. Customizable tags Equation recognition and conversion to LaTeX. Support for Mac/Linux Tags to exclude material from handwriting recognition Diary/schedule More sophisticated options for cutting and pasting digital notes (in addition to rectangle, freehand and timeline) Tags to exclude material from upload. Timestamped notes to tie to a particular dataset. Reference management facilities e.g. reference auto-completion, auto-generation of BibTeX/Endnote databases Tying printouts to digital notes (simulate stapling action) Pressure sensitivity - change colour according to how hard you press. Score 24 22 20 19 19 18 17 16 16 15 15 14 14 13 10 11 12 13 14 14 Tags to indicate privacy of digital notes. Ability to write over a powerpoint when presenting it. Online repository to share digital notes Other coloured inks (in addition to blue and black) Ability to record audio notes and associate with a page. Pencil version 13 12 11 10 7 7 Paradoxically, when users were asked in the survey, 'Who, if anyone, do you intend to share the material captured by the pen with?', willingness to share with others was indicated in six out of ten cases. Presumably, these users were happy with the use of email or IM to achieve this same objective rather than through more tailored online repositories. Rank 1 2 3 4 9 Requirement Close collaborators/friends No-one - the material is completely private to me. Other - Students I teach, supervisor, collaborators General public Occurrences 5 4 1 0 Conclusions and Recommendations 1. Anoto Functionality can bring significant benefits to the researcher who uses paper as a fundamental component of their research practice, for example, by providing a backup, facilitating sharing, making material easier to locate, and offering opportunities for capturing information digitally in situations where a tablet or PDA is unavailable or cumbersome. 2. Although the sample community in this study expressed little desire for a structured online repository for digital notes, experiences reported by other projects, such as the evaluation at CERN, have shown this concept to be highly relevant in situations where there are already well established and justified practices of regularly sharing paper-based material amongst collaborators. 3. The significant interest in the possibility of converting handwritten equations to LaTeX (a mathematical typesetting language) appears to merit further research in this area, building on work such as PenMath at the Ontario Research Centre for Computer Algebra. 4. If handwriting recognition is to be offered to the scientific community as a key value proposition in future products, further work is required to make it more reliable for scientific text. Support for Greek and common short forms needs to be supported by default, as well as simple table and column recognition. There also needs to be a straightforward way to tag sections to be excluded from this processing. 5. Special tags to associated pages together, a document annotation solution, a Mac/Linux version, and integrated search were the requirements deemed most important by evaluators and as such should be seriously considered by those developing tailored solutions for this community in future. 6. In the scientific community there are well established practices of building and exploiting home-grown software to meet highly particular needs not catered for by commercial suppliers. We would strongly encourage Anoto and its partners to publish their file formats so that those in the scientific community can build on the technology, making it more useful and facilitate a wider take-up. 7. To encourage a wider adoption, Anoto technology suppliers should consider allowing end-users to print out their own paper, and to implement greater interoperability between paper formats produced by different suppliers. Allowing end users to print out their own paper on scrap could alleviate some of the environmental/cost concerns associated with the use of the special paper. 10 References [1] Integrative Biology VRE Project, http://www.vre.ox.ac.uk/ibvre/ [2] Integrative Biology project, http://www.integrativebiology.ac.uk/ [3] IBVRE Initial Analysis Report, http://www.vre.ox.ac.uk/ibvre/IBVRE%20Initial%20Analysis%20Report.pdf [4] Memfem, http://virtualheart.bmen.tulane.edu/pmwiki/pmwiki.php/Main/LabSoftware [5] CMISS, http://www.bioeng.auckland.ac.nz/cmiss/cmiss.php [6] McCormack,J.B., Morrow, R.K., Bare, H.F., Burns R.J., Rasmussen J.L. 1991. The complementary roles of laboratory notebooks and laboratory reports, IEEE Trans Education, vol. 34. 133-137. [7] Paul Luff, Christian Heath, Moira C. Norrie, Beat Signer, Peter Herdman: Only touching the surface: creating affinities between digital content and paper. CSCW 2004: 523-53 [8] schraefel, m.c., ., Hughes, G., Mills, H, Smith, G., and Frey, J. 2003. Breaking the Book: Translating the Chemistry lab Book into a Pervasive Computing lab Environment. Proceedings of CHI 2004 Vienna Austria [9] Anoto Functionality, http://www.anotofunctionality.com/ [10] Nokia Digital Writing Solution, http://www.nokia.co.uk/ [11] Logitech IO2 Digital Writing System, http://www.logitech.com/ [12] Gaver, B., Dunne, T., Pacenti, E. 1999 Design: Cultural Probes. Interactions Vol 6, Issue 1. 21-29. [13] Brown, B., Sellen, A.J., and Ohara, K. 1999. A Diary Study of Information Capture in Working Life. HP Labs Technical Report. [14] LaTeX, http://www.latex-project.org/ [15] Zanón. E. 2001 El Diario de Laboratorio en Diseño Electrónico y su uso en el Ámbito Universitario En: Mundo electrónico, , Nº 321, 60-6 [16] SmartTea, http://www.smarttea.org/ [17] OpenLogBook, http://openlogbook.sourceforge.net/overview/index.html [18] The ATHENA experiment, CERN, http://athena.web.cern.ch/athena/ [19] ATHENA tests the digital logbook, CERN Courier, Volume 45, Number 1, http://cerncourier.com/main/article/45/1/13 [20] PenMath, Pen-Based Interfaces for Mathematics, Ontario Research Centre for Computer Algebra, http://www.orcca.on.ca/PenMath/ [21] SurveyMonkey, http://www.surveymonkey.com/ 11 Acknowledgements The Integrative Biology VRE project, led by the University of Oxford, was funded by the JISC Virtual Research Environment programme from April 2005 to March 2007. It involved three departments at the University of Oxford: the Computing Services (through the Research Technologies Service), the Computing Laboratory, and the Oxford e-Science Centre (now the Oxford e-Research Centre). It also involved members of the e-Science Centre at the Council for the Central Laboratory of the Research Councils (CCLRC). We would like to thank all those who participated in the evaluation: Carina Edwards, Jon Chapman, Gil Bub, Alex Rowley, Alex Walter, Gary Mirams, Inge van Leeuwen, Helen Byrne, Blanca Rodriguez, Sharon Lloyd, Rebecca Rowland, and Alan Garny. We would also like to thank Paul Luff, Jeremy Leighton John, François Grey, and Susheel Varma for their valuable input to the workshop on digital paper, and for commenting on previous drafts of this evaluation report.