Nugget Investigation Plans

This entry has been jointly authored by Peter Tolmie and Paul Wilson

A key part of the motivation for keeping texts is that they contain information – nuggets – that have some kind of future value. It’s rare for a whole document to be seen as having such value in its entirety. Nuggets are more often certain sentences or paragraphs. The question is, what happens to that value over the lifespan of an archive. Does the value of specific nuggets persist? Does it change? Does it grow or reduce in relevance? Does it become eroded to the point of obsoletion? And, given the same documents at some later point in time, would the same nuggets be identified, or would something else stand out as being more important?

To assess the use and impact of identifying nuggets, three separate investigations will be conducted using documents containing nuggets that were collected over the last 38 years. In all cases, investigations will be undertaken with reference to the individual who originally identified the nuggets. Two of the investigations will focus on the individual reflecting himself upon his use of nuggets. The third investigation will focus upon extracting information about the nuggets and their use through discussion.

The investigations will be performed using two separate sets of material:

Set 1: A set of 19 MindMaps relating to esoteric books.

Set 2: Documents from the PAWDOC collection.

Approach

Investigation 1: The first investigation will be a written exercise using the set of Mindmaps and will attempt to assess:

a) Whether individual nuggets can still be recalled;
b) Whether the sources of those individual nuggets can still be recalled;
c) How significant specific nuggets are considered to be by the individual concerned;
d) What other nuggets, if any, are associated with the one in question;
e) What specific concept(s) individual nuggets are believed to have contributed towards.

Investigation 2: The second investigation will also be a written exercise and will explore what nuggets, if any, can be identified by the individual concerned in unmarked versions of randomly selected documents from the PAWDOC collection where nuggets have previously been identified. From this exercise it is hoped to deduce:

i) Whether nuggets lose their vitality over time, with new concepts being derived and becoming established.
ii) Whether looking at the documents anew, within a distinct context, will lead to the identification of different nuggets with different relevance to the individual.

Note that the first 2 investigations are interleaved so as to be able to maximise the amount of discussion possible in the concluding interview.

Investigation 3: This investigation will be conducted as an interview and will explore similar themes to those tackled in the prior investigations but in a more open-ended way, so that the reasoning involved in retaining documents for the sake of specific features can be examined in greater detail. The interview will also seek to examine in detail topics that have spanned all three investigations.

Method

Investigation 1: Nineteen nuggets, each from separate MindMaps, will be randomly selected from the MindMaps of books on esoteric subjects. Randomisation will be achieved by creating a template that divides an A4 page into 24 numbered areas of equal size. A random number generator will then be used to create nineteen separate numbers ranging from 1 to 24. The Researcher, Peter Tolmie, will generate the numbers and use each one in conjunction with one of the MindMaps and the template to identify the nugget(s) present in that location. If more than one nugget occurs in that location, the topmost one will be selected.

The Researcher will assemble the nineteen nuggets and send them to the PAWDOC Owner and Subject, Paul Wilson, who will be asked to provide a written answer to the following questions in relation to each one:

  • Do you remember where this nugget came from?
  • Do you remember why you might have marked this out as a nugget?
  • How important do you consider this nugget to be now?
  • Do you remember what other nuggets were associated with this one?
  • How did you use this nugget and what other things did you develop on the back of it?

The Subject will then be sent the original MindMaps in which the nuggets appeared, and will be asked the following questions for each one:

  • Do you remember more about why you marked this as a nugget now?
  • Now you see the MindMap it came from and the other nuggets it was associated with, do you see it as more or less important?
  • Do you remember anything more now about how you used it or what other ideas it may have contributed to?

The Subject will return his responses to the Researcher, who will categorise them and place the results in an analysis spreadsheet. The overall analysis and specific instances regarding the Subject’s responses and reactions will be written up as the findings for Investigation 1.

Investigation 2: Nineteen separate documents, each containing nuggets, will be randomly selected from documents included in the PAWDOC collection between 1981 and 2011. Randomisation will be achieved by using a random number generator to create numbers relating to the 16925 entries in the PAWDOC Index sequenced in the order in which they were created.  The Researcher will generate the numbers and use his copy of the Excel version of the Index to identify the first nineteen of the entries for which an electronic file exists (some entries just contain the information they relate to and have no associated documents – these are identifiable by the contents of the Movement Status field); and for which the associated electronic files are likely to contain highlighted nuggets (items such as, for example, Health & Safety booklets, are unlikely to contain highlighted nuggets). He will send the list of numbers to the Subject who will open the folder labelled with each particular number, take a copy of the first file that appears in each folder and will send the files back to the Researcher. The Subject will take as little notice of the file titles as possible by setting up the Windows File Explorer window to display only the beginning of the file name so that only the Reference Number, which is always at the start of the file name, is visible. Should the Researcher deem any of these files to be unsuitable, he will send additional numbers to the Subject until a satisfactory set of nineteen documents that contain nuggets has been obtained. Then, using a cropping tool or by obtaining a clean copy of the document from elsewhere, he will send clean unmarked copies back to the Subject. The Subject will read the documents, mark up any text that he considers to be nuggets, and will send the documents back to the Researcher.

The Researcher will then make a comparison between the original nuggets identified and the new nuggets identified and record the results in the analysis spreadsheet. The overall analysis will be written up as the findings for Investigation 2.

Investigation 3: Nineteen nuggets, each from a separate document, will be randomly selected from randomly selected documents placed in the PAWDOC collection between 1981 and 2011. Randomisation will be achieved by using the same procedure employed in Investigation 2.  The Researcher will take the first nineteen suitable documents that contain nuggets and randomly select one specific nugget from each document using a random number generator. The nineteen nuggets will be assembled together and presented to the Subject who will be asked to answer a similar set of questions to the first set of questions in Investigation 1. These questions will form the rough frame for the first part of an interview in which the various nuggets will be discussed.

The Subject will then be shown the original documents in which the nuggets appeared, and will be asked the following questions for each one, which will form the second part of the interview mentioned above:

  • Do you remember more about why you marked this as a nugget now?
  • Now you see the document it came from and the other nuggets it was associated with, do you see it as more or less important?
  • Do you remember anything more now about how you used it or what other ideas it may have contributed to?
  • Are there things in the original document that you didn’t mark as a nugget at the time that you would mark as a nugget now? If so, why?

A third and final part of the interview will explore the responses from across all of the assessments in a more open-ended fashion to generate deeper insights and discussion.

The Researcher will then transcribe the interview, categorise the responses and place the results in the analysis spreadsheet. The overall analysis and specific instances regarding the Subject’s responses and reactions will be written up as the findings for Investigation 3.

Conclusions: The Researcher will use the findings from all three Investigations to write the overall conclusions of the investigation.

PAWDOC: Indexing (and the Accession process)

Indexing is a key aspect of a filing system because it is one of the mechanisms used to find documents. An alternative mechanism is to search the full text of documents, but the PAWDOC system has not explored this approach because a) the PAWDOC system does not recognise the full text of all the documents it contains, and b) because I have always believed this approach would provide too many hits thereby reducing search effectiveness.

Indexing within the PAWDOC collection is performed at two levels: first, each entry (uniquely identified by a Reference Number) in the main index file contains a Title field (in which a free format description and any number of uncontrolled keywords can be specified) and a few other fields including publication date. Second, each electronic file (be it a scan of a hardcopy document, or a born-digital document) associated with a particular Reference Number contains a unique file title. The contents of these two sets of indexing information provide the means to search for a document, to decide if you have found what you are looking for, and to get a better idea of what the chosen Reference Number and/or associated electronic file(s) contain. These potential uses are worth bearing in mind when indexing information is being specified.

Inevitably mistakes are made when specifying Title information and so it is essential that index entries can be changed over time to correct typos, grammar and factual errors as necessary. Changes may also be inspired by experiencing difficulty in finding a document, and, in these cases, some additional text may be included to reflect the search terms that first came to mind, or to reflect the changes in terminology that occur regularly in language and in today’s fast moving technical and cultural environment.

Indexing is closely integrated into the process of including a new document in the PAWDOC collection, not least because adding a new file involves creating a file title which in turn includes a Reference Number and possibly some text from the associated Index entry. Hence, timings for how long indexing takes are embedded within the overall process of Accessioning (which includes scanning for hardcopy documents). For the PAWDOC collection, which uses a scanner acquired in 2012 and a laptop acquired in 2018, it takes approximately 2.5 – 4 minutes to accession a single colour A4 hardcopy page; and about 3.5 – 5 minutes for a 5 page double-sided colour hardcopy document. An electronic file of any type and number of pages takes approximately 2 – 3.5 minutes to accession. These are appreciable amounts of time for an overhead administration activity amidst a busy working day. Therefore, it is worth exploring any means of reducing these timings. The obvious way is to keep the number of index fields to an absolute minimum. Experience with the PAWDOC system indicates that integrating keywords into the Title field (as opposed to having a separate keyword field) has been very successful. Conversely, the Publication Date field has not proved to be very useful and I believe could be dispensed with. Experience with the other fields used in the PAWDOC system are summarised below:

  • Reference Number – essential
  • Creation Date – a useful control
  • Movement Status – very useful for recording the whereabouts and status of documents
  • Date Last Accessed – only useful if you particularly need it.

Specific questions relating to this aspect are answered below. Note that the status of each answer will fall into one of the following 5 categories: Not Started, Ideas Formed, Experience Gained, Partially Answered, Fully Answered.

Q10. How long does it take to index a new document?

2001 Answer: Fully answered: It takes 1 -2 minutes to make an entry in the main index and to write the Reference Number on the document. A further 30 seconds is needed to create an entry in the document management system for either an electronic file or a document that is to be scanned immediately (Wilson1995c: 1). Scanning a document will add a further 40-50 seconds for a single page – though the per-page time is reduced considerably for multi-page documents that are put through the sheet feeder (Wilson 1997: 3).

2019 Answer: Fully Answered: This question should really be ‘How long does it take to index a new document and include it in the collection?’ because the indexing process is closely integrated with the process of saving the file. The figures in the 2001 Answer for making a new entry in the main index and writing the Reference Number on the document (1-2 minutes), and for creating a new entry in the document management system (an extra 30 seconds) still stand.  Scanning a document using the Canon DR-2020U scanner I bought in 2013 and the new Chillblast laptop I bought in 2018 with a 1 Terabyte solid state drive, takes 25 seconds for one colour page or 70 seconds for 5 double sided colour pages. However, the Document Management system was eliminated from the PAWDOC architecture in 2018 and replaced by Windows folders, so the subsequent set of actions has changed to the following: a) Create the main index entry (1-2 mins); b) Create a sub-folder within the main PAWDOC folder in the Windows file system, with the new Reference Number and copy the Ref No (15 seconds); c) create the file title (Ref No   Title,  Date scanned) in the scanner control application starting by pasting in the copied Reference Number (15 – 30 seconds); d) Save the file – use the scanner control application to select the new folder just created to specify where the file should be stored and press SAVE (20-30 seconds); e) check the file – open the file to check that it has been stored in the correct place with the correct title (15 seconds); f) Check the folder – open the file to check it has been scanned correctly (only for multi-page docs or for docs where there may be an issue) (15 seconds). The process for electronic documents has also changed as follows: i) create the main index entry (1-2 mins), ii) create a sub-folder within the main PAWDOC folder in the Windows file system, with the new Reference Number and copy the number (15 seconds); iii) Create the file title – open the SAVE AS dialogue box and give the file a new file name (Ref No   Title,  Date saved) by first pasting in the copied Reference Number (15 – 30 seconds); iv) Save the file – navigate to the folder into which the file is to be saved and press SAVE (20 – 30 seconds); v) Check the file – open the file to check that it has been stored in the correct place with the correct title (15 seconds). Of course, electronic documents which are placed into an existing Ref No have no need to create a new index entry or a new folder.

In summary, overall timings for including new documents in the collection are as follows: for a 5 page double-sided hardcopy document using a new Ref No – approximately 3.5 – 5 minutes; this may reduce to about 2.5 – 3 minutes if an existing Ref No is being used. For an equivalent 5 page electronic document using a new Ref No – approx 2 – 3.5 mins; this may reduce to 1 -1.5 mins if an existing Ref No is being used. Timings for a hardcopy document with only 1 page of contents are reduced by about 45 seconds due to reduced scanning time; electronic document timings stay the same regardless of page count.

Q11. What can you do to speed up indexing?

2001 Answer: Ideas formed: Keep the number of index fields to a minimum.

2019 Answer: Fully Answered: Minimise the number of fields. Automate the generation of the Creation Date. Eliminate the Date Last Accessed field if there is no good reason for recording that information.  To speed up the overall accession process, a) store more rather than less documents in existing Reference Nos (using existing Ref Nos avoids having to create new index entries); b) use a faster scanner; c) reduce the time it takes application programs to page through 17,000+ sub-folder names in the SAVE AS function (for some reason, in my Windows 10 system, the application programs take far longer to do this than Windows File Explorer).

Q12. What is the most effective set of index fields?

2001 Answer: Experience gained: The smallest number you can manage with (Wilson 1990: 95). Reference number, title and keywords, date of creation of the record, movement status, date last accessed.

2019 Answer: Fully Answered: Keeping the number of fields to the absolute smallest number you think you can manage with will minimise the time and effort spent on putting new items into the system and on managing the system. The minimum number of fields I could manage with are Reference No, Title, Creation Date, and Movement Status. The other two fields that I use, but which I believe I could do without, are Publication Date (I rarely refer to this) and Date Last Accessed (which was included mainly for research purposes).

Q13. What criteria should be employed when defining titles and keywords?

2001 Answer: Ideas formed:

  • Remember there is no need to use the actual title of the document.
  • Define titles/keywords in accordance with what the document means to you – that way you are more likely to be able to retrieve it.
  • Bear in mind that title/keywords serve multiple purposes (Wilson 1992a: 8):
    • To enable search and retrieval
    • To enable the user to decide if a retrieved record is what was being searched for
    • To provide the user with an understanding of what is contained in the item referenced by a retrieved record.

2019 Answer: Fully Answered: Do not feel constrained to use the actual title of the document – you are far more likely to be able to specify successful searches for items if you define Titles and Keywords which convey what the document means to you. If you are aware terminology is changing, use the latest terms you know as you are likely to become less familiar with the old terminology as time passes. Since there are no length constraints on the Index entry, use as many words as necessary to describe what the document is about. The second level of indexing – the text in the Titles of individual files – is constrained by length and for that reason might well be shorter than the main index entry. However, care should be taken not to truncate unnecessarily and not to just take the quick and easy route to create the file title by cutting and pasting a generic part of the Index entry. Certainly, where there are two or more files associated with the same Reference No, the titles of the 2nd and subsequent files should clearly distinguish them from all the other files in that Reference No. Also, bear in mind that Title fields (which include Keywords) and File Titles serve at least the following three purposes: a) to enable search and retrieval; b) to enable the user to decide if a retrieved record is what was being searched for; c) to provide the user with an understanding of what is contained in the item referenced by a retrieved record.

Q14. Are there circumstances in which keywords and titles should be changed over time?

2001 Answer: Experience gained: It is very necessary to be able to alter keywords and titles for the following reasons:

  • To accommodate the user’s growing familiarity with a topic. For example, the term `OA Human Aspects’ became an abbreviation `OA HCI’ in later entries, so it needed to be placed in previous entries to ensure they would be retrieved if only `OA HCI’ was specified in the search term (Wilson 1992a: 22).
  • To accommodate changing language. For example, the term `DTI pilot’ became `DOI pilot’ when DTI changed its name (Wilson 1992a, 12).
  • To accommodate additional material being added to an existing hardcopy file, or being added to the associated Folder in the Document Management System.

2019 Answer: Fully Answered: Users should be able to change Titles and Keywords at will to correct mistakes or to try and improve the search success rate. Circumstances in which changes might be made for the latter reason include:

a) to accommodate the user’s growing familiarity with a topic. For example, the term `OA Human Aspects’ became an abbreviation `OA HCI’ in later entries, so it needed to be placed in previous entries to ensure they would be retrieved if only `OA HCI’ was specified in the search term;

b) to accommodate changing language. For example, the term `DTI pilot’ became `DOI pilot’ when DTI changed its name; and

c) to accommodate additional files being added to the Reference No.

PAWDOC: Deciding what to file

Most types of things can be managed in a digital filing system. Documents, brochures and books can be either scanned or photographed; electronic documents are, of course, already in the right form; and even physical artefacts such as, for example, art works, can be photographed.

Once it’s been decided to establish a filing system, the reasons why the items concerned are being kept, and what is going to be done with them, should be written down. This should help in defining the following three key points: i) how long items are to be kept for; ii) whether any physical originals are to be retained after they have been digitised; and iii) whether every item is to be filed or just a particular subset. Point iii) is important because, if only a subset is required, the amount of time and effort that will be spent on adding items to the filing system, and managing it, will be reduced. Against this time saving must be weighed the extra effort – and potential errors that may be made – in deciding whether each particular object is to be included or not. Conversely, collecting every instance of a type of an item makes life easy and requires no decision making.

If the filing system is to replace or augment an existing collection, a decision must be taken about whether to exclude the existing items, or to undertake ‘backfile conversion’ to include some or all of them in the digital collection. Backfile conversion is often a lengthy and tortuous process and can act as a drag on proceeding with the new system; therefore, it should be avoided at all costs. If, however, it is thought to be needed, the reasons for doing it should be clearly understood before proceeding.

Specific questions relating to this aspect are answered below. Note that the status of each answer will fall into one of the following 5 categories: Not Started, Ideas Formed, Experience Gained, Partially Answered, Fully Answered.

Q5. What types of information and media can be managed by an electronic filing system?

2001 Answer: Fully answered: Just about anything can be managed by an electronic filing system including single sheets and stapled sets of hardcopy documents, scientific journals, volumes of bound material, ring binders, books, 35 mm slides, videos, electronic files and e-mail messages.

2019 Answer: Fully answered: Any kind of item can be managed by an EFS – though it may be necessary to also keep the original physical artefact to experience the full impact of certain types of items such as specially folded brochures, or significant documents with a tactile presence. Types of items that I have succeeded in managing in the PAWDOC collection include hardcopy documents, ring binders, books, journal papers, magazines, brochures, overhead slides, flipchart pages, CAD/CAM drawings, photos, videos, web pages, and email messages.

Q6. Can all types of electronic files be managed in an electronic filing system?

2001 Answer: Partially answered: Yes, certainly all types of application files can be. I have not stored executable files yet but have been advised that it can be done and how to do it.

2019 Answer: Fully Answered: Any types of file format – including executables – can be managed in this way. For example, file types in the PAWDOC system include Word, Excel, Powerpoint, PDF, TIFF, JPG, MP4, MPP, ZIP, and HTML. However, the real issue is whether you will be able to open them in future years if the particular application programmes are lost or not kept up to date. For this reason, it may be worthwhile converting some files to more common formats, such as PDF, before storing them. In the future, these so-called ‘Digital Preservation’ issues may get addressed automatically by AI programmes.

Q7. What criteria should be employed when deciding what to file?

2001 Answer: Not started

2019 Answer: Fully Answered: Individuals should file as much or as little information as they think they will need in their jobs – otherwise they will not have the motivation to spend the time introducing new material and managing the filing system. If a robust, all embracing, file facility is available within a particular system (such as in many electronic mail systems) such systems should be used to their full capability (i.e. retain everything up to any limitations set by the system administrator). Inevitably, whatever an individual decides to file, there will always be some material which is never subsequently accessed, and there will always be some required information that can’t be found in the system. This seems to be a regularly occurring phenomenon and should not deter individuals from filing – though they may influence the filing criteria that an individual applies over time.

Q8. Is it worth doing backfile conversion?

2001 Answer: Experience gained: Backfile conversion is certainly not essential, and is probably not worth doing, except for material you definitely want to keep for many years.

2019 Answer: Fully Answered: Backfile conversion usually takes a huge amount of time and effort, so a detailed analysis should be undertaken before deciding whether to do it or not. Alternative approaches to dealing with the old items should be investigated. Backfile conversion should only be carried out if there are compelling reasons for doing so.

Q9. What are the major considerations when carrying out backfile conversion?

2001 Answer: Experience gained: Backfile conversion requires a huge amount of time and effort and is not to be undertaken lightly.

2019 Answer: Fully Answered: Considerations to be taken into account when conducting backfile conversion include:

  1. Ensure the physical backfile material does not interfere with any space required by the replacement filing system.
  2. Decide if any of the backfiles being converted will need to be retained in their original form and, if so, define clear criteria for selecting which ones to keep.
  3. Decide what quality the converted backfiles need to be and then put in place the appropriate equipment, software and procedures to achieve that quality.
  4. Decide how users will be informed of which backfiles have been converted and which are yet to be converted.
  5. Establish a clear and doable schedule for the conversion process.
  6. Put in place motivational aids for sticking to the schedule. For example, setting a target of a certain number of conversions each day; and creating a visible progress sheet on which achievements can be ticked off.

PAWDOC: Collection content, size, growth rate and usage patterns

The PAWDOC collection was set up in 1981 to explore the application of office technology. In 2001 a paper was published in the Journal Behaviour & Information Technology called ’20 years in the life of a long term empirical personal electronic filing study’. This described PAWDOC and summarised findings about its use up to that point under the following 15 headings:

Two extra considerations are ‘Architecture‘ and ‘Requirements and Objectives‘. Now, a further 18 years on, the findings will be further reviewed in this and subsequent posts. The first of these follows below.

Collection content, size, growth rate and usage patterns

The precursor to PAWDOC was a conventional filing system in an upright cabinet using hanging folders and crystal tabs to store documents only (not journals, books etc.). The contents were specified in an index taxonomy with entries of the type, for example, 1.6.8 – Quarterly Progress Reports, which I was constantly adding to. Consequently much space was taken up in the cabinet by folders with crystal tabs housing only a few pages.  This changed after the PAWDOC system came into use: it needed far fewer hanging folders because each folder was filled to capacity with as many of the serially numbered documents that it could take.

The PAWDOC schema was explicitly defined to support the management of multiple sets of different material owned by multiple different owners. This capability was used to manage a variety of sets of my own material in addition to documents, for example, several different journals, 35mm slides (for making presentations), ring binders, and books; and, in the early years of the system, material owned by other people and organisations – though, as time went by, I made fewer and fewer such entries not least because of the uncertainty of being able to access such material.

The system soon became an integral part of my working life, and I used it just about every day. Several hundred new items were being added each year and this steady rate of acquisition soon ran up against the physical limits of the upright cabinet, so it became necessary to archive material in boxes and then to put some of the boxes in store. This went on until I started digitising newly acquired documents in 1996 and disposing of most originals. At this point, I also started to store born digital documents regardless of the applications they were created in.

In 2001 I started a new job in Bid Management which precluded personal storage of its associated highly confidential and fast moving documentation; so my useage of the PAWDOC system did reduce from then onwards. Nevertheless I continued to use it regularly on most days, and was still adding over 200 new items each year up to when I retired in 2012.

The 45 boxes of hardcopy that I had acquired eventually came out of store around 2001 and were stored in my garden shed. Their number was overwhelming and I began to doubt I would ever get them all digitised. However I stuck at the task, sometimes going at it solidly for several days at a time when my wife was away. By the time I retired in 2012 only 4 boxes remained and these had all been scanned by 2014: it was a great relief. The huge amount of physical space taken up by the collection had been reduced to just two archive boxes of significant hardcopy documents; and, after I had conducted a digital preservation exercise on the collection in 2018, the digital footprint of the collection amounted to some 115Gb. I have ended up with a fairly complete digitised archive of all the non-highly confidential materials that I had encountered throughout my working life, including substantial amounts of material from my earlier career from 1972 with Kodak and then CPC. Since retiring I’m continuing to add a few documents (around 40 up to 2019) in three categories: significant articles relating to the work I used to do; documents relating to the digital preservation of the PAWDOC collection; and material relating to work I am doing to investigate and document the findings from the PAWDOC collection’s 38 years of existence.

Specific questions relating to this aspect are answered below. Note that the status of each answer will fall into one of the following 5 categories: Not Started, Ideas Formed, Experience Gained, Partially Answered, Fully Answered.

Q1. What are the contents of the collection?

2001 Answer: Fully answered: At the beginning of July 2001, the collection consisted of 14,100 index entries representing approximately 185,000 pages of paper, 50,000 scanned pages, over 30 scientific journals (including Behaviour & Information Technology from 1982), around 30 books and conference proceedings, 3700 MS Word files, 400 MS Excel files, 250 MS PowerPoint files, 150 other electronic files of various types, and 10 CDs.

2019 Answer: Fully answered: The PAWDOC user Guide created in 2018 says: “All types of documents were stored including letters, internal memos, circulars, reports, specifications, minutes, overhead slides, 35mm slides, notes, training materials, brochures, manuals, maps, emails, computer magazines, journal articles, conference proceedings, and videos. As Office Technology became more versatile, electronic documents such as word processor files, spreadsheets, presentations and web sites were also filed.”. In June 2019, the collection consisted of 17,293 Index entries representing 29,610 electronic files in 16,067 Windows folders, and about 340 physical hardcopy documents in two archive boxes. A further 384 old electronic backup files are also stored in a separate folder. The The checking exercise conducted in 2016 identified the following numbers of different types of files in the collection: Word – 6380; Powerpoint – 466; Excel – 625; HTML – 382; Help – 90; Zip – 92;  11 other apps – 88; Scanned documents – 28,418. The collection was primarily a work collection and therefore the number of new items being included reduced to a trickle when I retired in 2012.

Q2. How much space does the collection take up?

2001 Answer: Fully answered: The paper takes up about 4.7 sq. metres of floor space and 1.7 metres of shelf space. The scanned images and electronic files take up 2.9 GB. The scanner, magneto-optical drive and CD Writer take up about 0.25 sq metres of desk space. The Filemaker Pro index is about 8.1 MB in size and the FISH data file is 11.2 MB. The Filemaker Pro, FISH, SQL Anywhere and Easy CD Creator software packages take up approximately 38 Mb.

2019 Answer: Fully answered: The two archive boxes stand one upon the other and take up 0.2 sq m of floor space. The laptop in which the electronic files reside takes up 0.08 sq m of desk space. The electronic files of the main collection take up 45.9 Gb storage space; and the backup files take up 66.6 Gb. The Filemaker Pro software used for the index takes up 336 Mb of file space.

Q3. What is the growth rate of the collection?

2001 Answer: Partially answered: Between 1981 and 1993 an average of 543 index entries were created each year – estimated to consist of an average 29.4 pages per day (Chan 1993:25).Over the whole 20 years life of the system, the growth rate has been an average of 705 entries per year with a range of 210 ± 1202new entries a year. In 1993, it was estimated that the collection was increasing at the rate of 3.8 MB per day.

2019 Answer: Fully answered: The growth rate of the collection is shown in the chart below.

There are three distinct phases: 1981 – 2000; 2001 – 2011 when I was working on highly confidential bids; and 2012 – 2019 when I was retired. The average growth rates per year during these periods were:

  • 1981-2000 – 696
  •  2001-2011 – 272
  • 2012-2019 – 46

Q4. How often are the contents accessed?

2001 Answer: Experience gained: Between 1987 and 1993, an average of 363 records were being accessed each year (Chan 1993: 25)

2019 Answer: Experience gained: The only data that has been collected on this question is in the date last accessed field, and unfortunately that only records the latest date an item was accessed – there may have been any number of earlier accesses. Furthermore, some items may have been accessed without the date last accessed field being updated. Having said that, 4,551 items have an entry in the date last accessed field, implying that 12,742 items have never been looked at for work purposes after they had been included in the collection (the date last accessed field was never updated when items were looked at for the purposes of controlling and writing about the collection). For the period from 2001 onwards, when I moved jobs into Bid Management, there were 751 index items with dates of 2001 or later in the date last accessed field (only 15 of these were in 2012 and only a further 15 of these were from 2013 onwards).

NB. The various references in the texts above to Chan,1993 relate to the following reference at the end of the 2001 paper in Behaviour & Information Technology:

CHAN, S. C. 1993, Feasibility of Paperless Office, Submitted in partial fulfilment of the requirement for the degree of MSc in Information Systems and Technology in The Information Science Department at City University, London (Supervisor: Dr David Bawden) [PAW/DOC/4012/08].

EitherOr

I wonder if any internet entrepreneur has come up with a web service to help with choices in present buying… I’m envisaging a website called EitherOr which enables people to outline a number of gift options, and possibly a ‘none of the above’ option, for the individuals they want to buy for. The recipients would be notified and be able to use links to review the items and to choose the option they want. The service would then purchase the item for them and have it sent to them. The ‘none of the above’ option, if provided, might enable the recipients to specify any item on the net up to the specified amount, for the service to purchase for them.

Nuggets about Nuggets across 38 years

To review what I’ve done on the topic of information nuggets, I’ve been trawling through the PAWDOC Index and files. The earliest example of sidelined text that I can find in my document collection was from October 1981 when I was working at the National Computing Centre. I can’t remember why I started to do it – but it may well have been prompted by the method that NCC’s Chief Editor, Geoff Simons, used to construct his books. He explained to me that he read everything about a subject, identified key points and put them on Post-it notes which he stuck on the wall. When he was ready to write, he rearranged the Post-its into separate sections and in sequence within the sections – and I saw examples of this in his office. At some point I started to employ this technique to construct the best practice books I wrote at NCC – but using the word processor on our new Zynar Office System to assemble and organise the key points. Sidelining text was an obvious way to identify key material to feed into that process.

Around 1994 I started talking with City University academics Clive Holtham and David Bawdon with a view to undertaking a joint project on ‘The Paperless Office Worker’. A key strand of this work would involve me digitising my PAWDOC collection. Extracts from emails between us in the early part of 1994 included the following:

Email from Clive Holtham: ‘….I don’t record each document as Paul does, but file at a quite detailed level. What I am conscious of is how much I forget about what I already have. The equivalent of underlining is important – we need to consider something more than keywords to store with each piece.’

Reply from Paul Wilson ‘…I agree with needing to deal with the underlining problem. I have sidebars on most of my material – they are the information nuggets; but I don’t know how much use they would be out of context.’

Email from Paul Wilson: ‘… with reports , papers etc. I usually mark the nuggets of info within them – presumably these are the bits of information I really want.’

The collaboration with City University proved very productive: Clive Holtham introduced me to a product Manager in Fujitsu who loaned me a scanner; and to the owner of a small company called DDS who loaned me the Paperclip document management software. Soon I was scanning my existing paper documents and new ones as they arrived. In January 1997 I issued my 3rd briefing note on these activities and included the following towards the end of the four page document:

Despite the close relationship between filing and information use, contemporary filing systems provide little other than title and index fields to support the knowledge acquisition, synthesis and use process. Filing systems, it seems, are there just to store items and to aid their retrieval. Unfortunately, the personal knowledge acquisition, synthesis and use process is not supported adequately outside filing systems either. Some standalone packages do exist, but they are not intended to be used in a day to day manner for personal knowledge acquired in documents, electronic files and other artefacts. In fact, even the need for such support is not widely recognised.

It is not yet clear to me what support could be most beneficial. However the clues are littered throughout the practices of knowledge workers like myself. For example, whenever I read articles, papers and reports I always mark the good bits – the nuggets of information. These are key points which I particularly want to augment the knowledge in my brain. Sometimes when I have been researching a topic I collect together all the nuggets I can find, categorise them and reorder them, and synthesise a new view of the topic in question. Unfortunately, like the magnesium nodules on the floors of the deep ocean, huge numbers of nuggets now litter my filing system unseen and inaccessible. I hope they are in my brain and that they have been used to develop my current state of thinking – but I’m not so sure that their huge potential has been fully exploited.

For my filing activities to really start adding value I need tools which can record those nuggets as I consume and index each item, and which can enable me to reorganise those nuggets, add more nuggets, and synthesise new nuggets, in the process of actively developing my ideas. Such tools would, of course, maintain the links to the original source material stored in my files. And my files would become a combination of original source material and the representations of my developing thoughts and ideas.

Now that I am confident that I have the paper scanning and electronic file indexing activities under reasonable control, it seems high time to start addressing the critical area of information use and its role in knowledge management.

These are the earliest mentions I can find of information nuggets in the PAWDOC collection – and they give no indication of where I picked up the concept from. In fact I can only find one published mention of the term and that was in the Lotus Notes-oriented magazine ‘Groupware and Communications Newsletter’ from April 1998 in which one Ted Howard-Jones gave a brief description of a service implemented by a major financial institution to capture competitive information. He wrote:

‘Called Report-It!, this service captures knowledge using a secure voice-mail system and delivers categorised information directly to the desktops of office-bound managers and competitive information professionals. These nuggets of professional information are disseminated via Notes.’

The use of the term Knowledge in this article, and in my briefing paper mentioned above, reflected the fact that, in the late 1980s, the term Knowledge Management started to became fashionable and by the late 1990s had become a holy grail of IT Professionals, Management Consultants, and Academics. The first mention of the term in the PAWDOC Index appears in 1990, and occurs in a further 137 Index entries from then to 2016. The company I worked for (Computer Sciences Corporation – CSC) was a global computer services organisation with tens of thousands of employees worldwide. Its involvement in the Knowledge Management topic came from three angles: first, its clients started to ask about it and how to do it; second, its consultants and salesmen saw it as a potential source of revenue; and third, its employees and management began to think that they needed it internally to improve the effectiveness of the business. Hence, as a consultant in the UK end of the business, I was aware of or got involved in:

  • A number of initiatives to develop an offering or information for clients, including:
    • the development of a KM service by CSC Netherlands in 1990;
    • discussions with CSC UK Management Consultants who were developing KM propositions, in 1996-8;
    • the definition and design of a KM service by CSC UK personnel to address opportunities in a number of clients including ICI Paints, LUCAS Engineering, John Menzies and United Distillers, in 1996-7;
    • the publication a CSC Research Services Foundation report on KM in 1998;
    • news of CSC’s Global Knowledge Management services, in 2001.
  • The development of such systems for clients (primarily using web pages on intranets), including:
    • a presentation to ICI Paints in 1992;
    • the development of systems for KM and for a web-based ‘Gazateer’ of all KM, architecture and other organisational information, for the Nokia SCC – a new organisation being set up by CSC UK for Nokia, in 1997;
    • the design and population of a web-based KM system for Dupont Agriculture’s architectural components, in 1998.
  • the development and use of internal CSC solutions, including:
    • attendance at internal CSC workshops on developing an organisational learning infrastructure in 1996;
    • the development of an improvement process for CSC’s new application development organisation that was being designed and built from scratch, in 1998;
    • the design of a practical KM programme for CSC UK’s reorganised Consulting & Systems Integration unit, in 1999;
    • knowledge management work being done for BAE, in 1999;
    • an internal Community of Interest on the subject of Personal Knowledge Management, in 2001.

Of course, my rather lowly personal filing perspective had to be rapidly expanded as I entered the Knowledge Management (KM) arena to accommodate both high level Management Consultancy notions of ‘Intellectual Capital’ and the distinction between Knowledge and Information; and the practical need to derive benefits from an investment in KM by effectively sharing the knowledge that had been acquired. Indeed, one of my contributions to the internal Community of Interest mentioned in the last bullet point above seems to herald a change in my thinking. My opening sentence reads:

‘Since collaborating with everyone in this shared space I’ve had my eyes opened to the concept that KM is all about enabling people to find things and work together, as opposed to the idea that KM is all about nailing down bits of knowledge and providing it to people. I realise that there is significant crossover between the two approaches – but nevertheless giving priority to one or the other will result in significantly different activities.’

During the 1990s I learnt a great deal about what people thought Knowledge Management was, and also about the powerful potential of the new web technology to support KM. However, throughout this period I don’t recall any specific conversations or documents about dealing with underlined, sidelined, or highlighted text. Indeed, by the time I completed the draft of the paper summarising my PAWDOC findings in June 2001, it seems that my ideas on the subject had advanced no further than that reported above. The paper was published in the journal Behaviour & Information Technology (BIT), and addressed the subject as follows in the section on ‘Areas of Investigation and Summary Findings’:

Q27. How can an electronic filing system be used to develop and use knowledge?

  • Include substantive information in the index entries, for example phone numbers, book references, and expense claim amounts.
  • Identify the nuggets of information (i.e. the valuable bits) when you first read a document
  • Capture and structure the nuggets into the overall nugget-base at the same time as indexing the item

Status: ideas formed

Q28. What is the best way to capture and structure information nuggets?

Probably by using a Concept Development tool. Some initial prototyping has been done using the Visual Concepts package and the eMindMaps package.

Status: ideas formed

Q29. Is it feasible and practical to capture and structure information nuggets as well as indexing items?

Status: not started

Q30. Is it worthwhile building and developing an information nugget base?

Status: not started

The Concept Development prototyping mentioned in the answer to Q28 above probably started in April 2001 when I acquired a free copy of the eMindMaps software, and by the end of 2001 I had started making MindMaps of books on esoteric topics such as the Egyptian pyramids and the origin of Atlantis. I have no record of my detailed intentions in doing this, but I guess I wanted to experience the process of recording all the nuggets I found in a book – and then to explore what could be done to integrate and exploit the material from several different MindMaps. In all, I made MindMaps of 19 books over a two year period; but that’s as far as I got. At the end of 2001 I started a new job in bid management and my energies were increasingly taken up with managing very intensive bids, with documenting the bid process, and with operating a Lessons Learned programme. I had no time to pursue these information nugget ideas any further.

This concludes my review of my previous activities in the use of information nuggets. The questions I posed in the 2001 BIT paper still remain largely unanswered, and the operation of the PAWDOC system has not provided any further insights on the subject since then. However, the existence of the sidelined documents, and of the 19 MindMaps, do provide an opportunity to undertake some rudimentary practical work to explore if the information nuggets identified were memorable and of any use. Subsequent entries will outline the methods that will be used to undertake these investigations, and will report on the results.

NBod1.0

I got to thinking in the shower the other day (probably prompted by the BBC’s Years and Years series) that we’ll be needing the concept of the NaturalBody sooner rather than later. NaturalBody people – NBod1.0 – have no piercings or permanent embellishments on the skin; have had no parts of the body deliberately removed, filled or cut (and that includes teeth and the sexual organs); and have not had surgery to alter their natural appearance. They have no permanently attached artificial physical appendages; and they don’t contain any embedded physical engineering equipment. NatBod1.0 people don’t have any embedded chips or other computer equipment, or software that connects them to any digital networks. NatBod1.0s have not had their DNA adjusted to enhance their or their offspring’s capabilities or appearance (the offspring will not be NBod1.0 as they will inherit their changed DNA status). NBod1.0s are simply people who are as they were born and who have developed naturally. They may have been born with disabilities, or have had accidents or illnesses – the notion of NBod includes no value judgements. In the western world there may be relatively few NBod1.0s (mainly babies and young people) as many people have had dental fillings and/or tattoos; but large numbers must still be up at the 0.8 or 0.9 levels. Going forward, however, we may need the concept of NBod to remind us of what humans are as we gain a growing capability to augment our bodies with technology. Even today, such a concept might help us make choices in the face of cultural and religious motivations to deface, cut, mutilate and remould our bodies.

Personal filing in a 40 year vortex of change

This entry has been jointly authored by Paul Wilson and Peter Tolmie

The PAWDOC filing system was set up in 1981 to try and understand how the newly emerging office technologies of that era might assist individuals to manage their office documents. Over its 35+ years of operation much has been learnt, and it is our intention to try and understand and describe those findings. However, the system was set up to address requirements in the office of the 1980s, and there has been a revolution in the way business operates since then. In order to be able to relate the findings to office work today and in the future, this entry explores the differences in  requirements for personal filing in the office between the early 1980s and 2019.

Perhaps the most significant difference is the transfer of huge amounts of information from paper-based documents to digital files. Note that this is not saying anything about the current volume of paper in the office (though we would suggest that there is probably less paper filed by individuals now than in the 1980s) – just that individuals now have to deal with huge amounts of electronic material in contrast to the early 1980s when they dealt with virtually none. This transition has been facilitated by a huge growth in the use of computer hardware – desktop computers, laptop computers and mobile phones – throughout the world, from a base of zero to near-universality.

The growth in the use of computers has also prompted huge changes in office work. At the beginning of the 1980s, office professionals used support staff – typing pools, secretaries, and admin support staff – to perform their administrative tasks.  However, as office technology became widespread, professionals started to do their own typing and support staff became a luxury which could be cut to reduce budgets. In 2019, only very senior management have secretaries and office workers are expected to be self-sufficient and fully competent in the use of all hardware and software relevant to the realisation of their work.

These changes were accompanied by a revolution in communications. Electronic mail has now almost entirely replaced internal memos and external letters, and has prompted massive increases in the amount and speed of communication. Email also rapidly became the key mechanism for supporting distributed teamwork – nationally and globally – and now underpins a battery of related interests, from the sharing of documents to the organisation of voice conference calls (which are the unsung foundation upon which much of business and government now operates). In more recent years, as mobile phones have permeated throughout the world’s populations, text messaging and chat applications have become an integral element of personal and business relationships. To this highly significant mix of new technologies must be added the recent massive uptake in Social Media. An unfortunate side-effect of the sheer effectiveness and pervasiveness of these mechanisms is high levels of information overload across a large proportion of office workers.

Importantly for the work to derive findings from the long-term operation of the PAWDOC filing system, the changes described above have impacted filing activities in the office. Hot desking and home working have made personal filing cabinets and bookshelves a luxury. The folder systems integral to computer operating systems (primarily from Microsoft and Apple) are now used to store the electronic documents created and received by the individual. At the same time, email systems have their own integral filing systems into which mail can be rapidly sorted and stored indefinitely in the cloud; text messages are stored on users’ mobile phones in the form of text streams by both senders and recipients; and Social Media systems have their own self-contained environments distributed across vast computing networks. The further evolution of cloud-based repositories, such as Dropbox and Google Drive has led to an added utilisation (if not trust) in distributed document stores. Even if users wanted to integrate these different collections, it would be almost impossible for them to do more than just copy selected elements from one to another or to a dedicated filing system: these stores are separate silos and will probably continue to be so for many years to come.

The design of the PAWDOC system in 1981 was based on an understanding of office filing requirements at the time. There was an expectation of how emerging office technology might be used to support those filing requirements, but little appreciation of how the technology itself would change the way business operates. Initially, then, learnings from the development of the PAWDOC system were entirely focused upon what the impact might be of new assumptions about filing built into the construction of computer systems in the early 1980s. Later on, in the middle period of PAWDOC operation, the findings speak to what it was taking to manage a filing system in a changing work environment populated by imperfect but maturing technologies. More recent findings give a somewhat different picture, as many of the troublesome technologies of the middle-era have come to be taken-for-granted resources, giving rise to new kinds of problem, of which information overload is but one potential symptom. What is clear at present is that computer technology and the business world is now changing so rapidly, the presumption present in the early days of PAWDOC – that one could readily identify needs and solutions for the future – now seems somewhat naïve (if still just as pressing).

One thing, however, we believe has remained constant and that is the attitude towards filing across the population. Most people are not motivated to put effort into filing because it is extra work for an indeterminate reward at some undetermined point in the future. A smaller subset of people is willing to put varying degrees of effort into the activity. We believe this has changed little between the early 1980s and the present day. As it happens, the PAWDOC owner was at the more extreme end of this latter group and wanted to file both effectively and comprehensively. Hence the PAWDOC collection contains most of the documents that the owner read and/or believed to be significant in his work; and consequently it should be borne in mind that the learnings derived from his experiences concern almost the worst case requirements of filing load and effort. It should be easier for most of the population. Certainly, it would seem easier, for digital copies are now retained of virtually everything as a matter of course. The extent to which that is oriented to as a personal collection of materials is a different matter, as is the probity of third parties hanging on to everything in that way. These, of course, are burning questions of the moment, and ones to which we shall ourselves return.

Knowledge Nugget Endeavours

In 1981 I was working in the newly formed Office Systems team in the UK National Computing Centre, and I was interested in how the new technology could support the management of an individual’s office documents. So, a colleague and I decided to experiment with our own documents. This was the start of a still-running practical exploration of how to manage personal documents using digital technology.

It was my practice to highlight key text with a side line as I read documents, and, as my document collection grew, I began to wonder how I could make explicit use of this very specific information. No doubt the act of highlighting was in itself helping me to assimilate documents; but I wasn’t sure if all the highlighted facts were being retained in my brain and being used to develop new concepts.

During the 1990s, the trendy new topic of Knowledge Management emerged which provided a recognised arena in which I was able to explore these ideas. Sometime during this period, I latched onto the term ‘nugget of information’ (the first published mention of this in my filing index is in an article by one Ted Howard-Jones in the March 1998 issue of the Groupware and Communications Newsletter). My attempts to relate lowly personal filing to the Knowledge Management field eventually fizzled out in the face of much sexier concepts such as an organisation’s ‘intellectual capital’. However, in the early 2000s, I did make a specific attempt to see if I could use Concept Mapping software to capture nuggets, by applying it to 19 new age books on the pyramids and the like; but that is where my knowledge nugget endeavours ended.

Now that I’m trying to find a home for my document collection, and to identify the findings from its long term operation, it seems a timely moment to review this particular aspect, to do some practical work on the nuggets I’ve identified over the years, and to draw some conclusions on the topic.

Doggie Tales

A question that keeps arising in the Order from Chaos investigations documented on this site is ‘why are things being kept?’ One answer is that an item reminds us of people or events that we want to remember; and remembering such things seems to be important for humans. The parable below hints at a reason why.

Doggie Tales – a parable about existence in our world

On platform 5 at Slough railway station there’s a glass box on the wall and it contains the stuffed remains of Station Jim, a beloved dog well known to passengers using the station in the 1890s. A plaque explains as follows:

“Dog Jim was first brought to Slough station when he was about three months old. He was like a ball of wool then, and could be carried about in an overcoat pocket. The first trick taught him was to get over the stairs of the footbridge, and he learnt it so well that he never once crossed the metals from the time he was brought here to the time of his death.

He started his duties as Canine Collector for the Great Western Railway Widows’ and Orphans’ Fund when he was about four months old but, because he was in bad health, he was only actually collecting about two years or so. Yet he managed to place about £40 to the account of the Fund. He only once had a piece of gold put in his box — a half sovereign. On several occasions half crowns were found, but the majority of the coins he collected were pennies and halfpennies. After a time he was taught to bark whenever he received a coin, which caused a great deal of amusement to his numerous patrons. One Sunday during the summer of 1896, a hospital parade was organised at Southall, and his trainer was asked to take him up there to collect. The result was that when his boxes were opened by the Treasurer 265 coins were in them. There were only about five pieces of silver, but when it is remembered that he barked for each coin given him, this must be regarded as a good afternoon’s work.

His railway journeys were few in number. On one occasion he went to Leamington; that was his longest ride. Another time he got into a train and went to Paddington, but was seen by one of the guards and promptly sent back again. Another day he got into a train and was taken into Windsor. The officials saw him, and wanted to put him in the next train home, but he would not agree to that, and walked back through Eton.

He knew a great many amusing tricks. He would sit up and beg, or lie down and “die”; he could make a bow when asked, or stand up on his hind legs. He would get up and sit in a chair and look quite at home with a pipe in his mouth and cap on his head. He would express his feelings in a very noisy manner when he heard any music. If anyone threw a lighted match or a piece of lighted paper on the ground he would extinguish it with a growl. If a ladder was placed against the wall he would climb it. He would play leap frog with the boys; he would escort them off the station if told to do so, but would never bite them. At a St. John Ambulance Examination held at this station he laid down on one of the stretchers and allowed himself to be bandaged up with the rest of the “injured”. He was a splendid swimmer and a very good house dog. He died suddenly in his harness on the platform on the evening of November 19th 1896, and was afterwards placed here by voluntary contributions from a number of the residents in Slough and the staff at this station.” [reproduced on 15May2019 from the Wikipedia entry for Slough Railway Station]

I first came across Station Jim when my office moved to a building opposite Slough Station in 1986 and I occasionally travelled up to London for meetings. There he was in his glass case, 90 years after his death, still intriguing passengers as they waited for their trains. He stuck in my mind, and although I haven’t visited Slough station for 25 years, he popped up in my head as I thought about writing this piece. I googled him and came up with his story straight away. In fact, a search for ‘Station Jim Slough’ produces some 685,000 hits (a search for just ‘Station Jim’ results in a misleadingly huge number of hits probably because a TV film based very loosely on the dog was made in 2001).

Wikipedia cites the Office of Rail and Road‘s statistics in saying that Slough railway station has over 4 million users every year; so it’s reasonable to suppose that, since the display was installed in the late 1890s, many millions of different people must have looked at Station Jim’s taxidermied remains and read about his life. His display and plaque bear testimony to his existence; and they continue to create and reinforce his memory in the minds of hundreds of thousands of people every year, just as his physical presence on the platforms did all those years ago.

Our beloved Cavalier King Charles spaniel, Alfie, died peacefully a week ago from a heart attack aged eleven. He was a loving dog and just wanted to be close to us all the time. Two days after he died we gave all his artefacts – baskets, bowls, food, etc to a dog charity with the one exception of Monkey, the first stuffed toy he ever had and his favourite throughout his life. Monkey has had a wash and now resides on the settee. Alfie was an integral part of our lives, and was also loved by other members of our family and friends. He inspired the concept of ‘Alfie time’ – 6pm – time for drinks and snacks. It was of course the crisps, pretzels, cheesy nibbles etc. that Alfie knew he was entitled to, that drove him to remind us every evening (sometimes with extravagant displays of crouching and turning and whimpering and short barks) when it was Alfie Time. Alfie Time became an established feature in our lives and will be forever thus for us and family and some friends.

We have many photos of Alfie in the indexed and labelled digital family photo collection and physical photo collection in albums, that I have painstakingly built up from all the photos and negatives that I could find in our house and my mother’s house. In fact, a search of the digital collection found 259 photos with Alfie in the file title. Many of these are also in the physical albums. From time to time we’ll look at these photos and they will bring back all our memories of Alfie, how he behaved around the house, how he was so pleased to see when we returned to the house, and all the good times we had with him when we took him away with us. We can’t be certain what will happen to our physical and digital photo collections after we are gone; but we would hope that our children and grandchildren would value this family archive enough to look after it and perhaps even look at it occasionally. If and when they do, they will find Alfie’s picture appearing constantly throughout those eleven years of his life together with descriptions in the file titles and album slip-in tabs of what he was doing and reflecting the close bond he had with us. As they look at those photos, the memories of those who knew Alfie will come flooding back. For our grandchildren, who only knew Alfie briefly up to when they were about 2 and 3 years old, the photos will bring meaning and tangibility to some traces in their minds. For those who come later, the images and words (should they survive the years) will create and reinforce memories of our plucky, loving dog.

When I was three, my mother and father took me to Singapore where my father had got a job as a shipping agent. Shortly after arriving there, we got a dog – a dachshund called Mandy. I have some clear memories of Mandy, and there are 8 photos of her in the family photo collection. I went home to boarding school when I was eight and sometime after that Mandy died. My mother tells me she died peacefully lying on the drive in the sun while she was out. She and I both remember Mandy with fondness. However, my father is dead and none of the rest of the family ever met Mandy, so really we are the only two who have a strong recollection of her time on this earth. Perhaps there are one or two old timers in their eighties and nineties who came to our house in Kheam Hock Rd and retain a fragmentary image in their minds of her – but they’re dying out fast. I mention Mandy to the family very occassionally and her photos might be stumbled on in the photo collection from time to time; but her presence in our collective minds is dimming as the years go by. Eventually there’ll be just those 8 photos and associated words in the file titles (should they survive the generations) that will bear testimony to the part that Mandy played in our lives at Kheam Hock Rd.

My wife’s family had a dog when she was little. He was called Bruce, and they lived in a house in Leeds. My wife remembers Bruce but rarely talks about him; and I have no recollection of conversations with her mother, brothers or sisters about Bruce – in fact, her two brothers had not even been born before Bruce died. I’ve found 2 pictures of Bruce in the family photo collection and, although I must have indexed them and created their file titles, I’d forgotten they were there or what they looked like. They have left new traces in my mind overlayed with the conversation I had with my wife about him yesterday morning. They are images and information of interest but they inspire no emotion in me; and I guess my wife feels the same about what I tell her about Mandy. Perhaps my wife will talk about Bruce to the family sometimes in the future, but, like Mandy, Bruce’s presence in this world will eventually fade to just those two black and white photos that may spark an interest in those who see them.

My mother’s parents got a dog – a wire-haired terrier – soon after we moved out to Singapore in 1953. We have just one photo of my grandfather and the dog taken in the mid 1950s in the garden in Old Retford Rd, Sheffield. My mother never met the dog and can’t remember what it was called or anything else about him – only that he was run over outside the house to the great mortification of her parents.

Earlier generations of our family probably – possibly – had dogs…. Who knows?