Nearly finished

For the last 4 years I’ve been tackling my parents lifetime photo collection which includes all the photos they have inherited from their forebears. It’s required a good deal of effort and persistence. If you intend to take on such a challenge you’ll need to be prepared for the long haul.

In order to find out what/who the photos are of and when they were taken I’ve had a discussion with my mother about most of them. Inevitably, she remembers less about some of the older ones (taken, say, 70 or 80 years ago) than some of the newer ones; and she can’t provide any light at all on some of those she inherited from her parents or from my father’s side of the family.  In those cases, I’ve used every clue I can find (type and size of print, numbers stamped on the back, writing on the back, similarity of scene, etc.) to identify related photos and deduce some information about their progeny. I’ve recorded a few of the conversations we’ve had about the photos; and deciding how to keep those recordings is one of the remaining tasks I have to do. However, apart from that and putting  a few remaining photos in albums, the job is just about finished. All 7000 or so photos have been indexed, scanned, and the file titles populated with reference number, contents and dates; and then the physical photos have been put into one of about 19 slip-in albums.

I’ve done similar jobs on my own pre-marriage collection, on my wife’s family collection, and on our own family collection. That is where the insights I’ve recorded earlier in this topic have come from. Now it remains for me to tie up loose ends and to decide whether to collect everything I’ve learned into a single article or not.

The eBox

In today’s digital age it is, perhaps, preferable to store paper documents by digitising them (as either scans or photos). However, sometimes one wants to retain the originals, and, in that case, the two forms have to be stored in separate places. It would be useful to be able to store them  both in a single container (an ebox perhaps) which is able to both store physical pages AND display electronic pages.

Digitisation in progress

Since my last entry I’ve been steadily digitising the remaining paper in my lifetime work document collection. These are documents I want to retain in original form (some of which have a comb binding), documents that need to be scanned in colour, or documents that were too large to go through the scanner. I acquired a better comb binding machine at the end of October, my current scanner has full colour capability, and I’ve found that photographing large items with my modern camera produces a perfectly readable on-screen image. So there’ve been no more obstacles to getting the job done. As each item is digitised and the file inserted into the FISH document management system, I’m checking the index entry and updating the Movement Status field with either OK or XX as described in my last entry.  At the current rate of progress I should finish the digitisation work by the end of January.

Case Study in DPC Report

Having completed my paper on ‘Preservation Planning for Personal Collections’ I sent it to the people mentioned in the Acknowledgements section to get their permission to include their names. In October, I received an offer from two of the people I had acknowledged – Neil Beagrie and William Kilbride – to make the paper available as a case note on the Digital Preservation Coalition’s (DPC) web site early in 2016. I was pleased to accept the DPC offer. Since then, the work I had done for the paper was included as a case study in the DPC Technology Watch Report on Personal Digital Archiving which was published on 15Dec2105. This is a very informative document with useful advice for individuals and I’m pleased to be a part of it.

I am now waiting to hear from the DPC about what changes to the paper its reviewers have suggested, and the timescale for the paper to be published in the DPC website. My hope is that, once it is published, I might hear from some people interested in collaborating in applying the process described in the paper to my work document collection.

Checking the Collection

Two of the remaining things to be done with my lifetime document collection are to:

a) scan the remaining paper (documents not yet scanned because they were labelled as artefacts to be retained in both their paper and electronic form); and

b) go through all the index entries making sure they contain valid information and that there is an equivalent scan in the Document Management System.

For a) some of the paper documents have comb bindings and will require a binding machine if they are to be scanned using a sheet feeder and then reassembled in the comb binding. I acquired a very cheap comb binding machine on ebay some three weeks ago (though, it seems it was false economy – it stopped functioning properly and I had to send it back yesterday…) and have made a start on scanning the remaining paper. I’m addressing b) in parallel, and recording any issues or key points I find using the following notation in the ‘Movement Status’ field:

OK = The Index entry is as complete as possible and there is an equivalent scanned version

XX = There is a serious issue with this item.

Should the index entry and scans be present but there are some points to be recorded about them,  the ‘OK’ notation is qualified within brackets as follows (multiple qualifications can be recorded within the brackets as necessary separated by  a comma):

  • OK(multi): one or more of the equivalent scanned files in the FISH Document Management System are in the form of multiple TIF files – one for each page. FISH obscures the fact that there is a separate file for each page – but that is how the scan is actually stored.
  • OK(n docs): This identifies when there is more than one scanned document associated with this index entry – where n is the number of separate documents (this is a feature of this approach to electronic filing – multiple documents can be stored under a single Index entry).
  • OK(poor): the quality of some or all of the scanned electronic pages is poor.
  • OK(dbl): one or more of the associated scanned files came from documents with double sided pages which have been scanned all of one side first and then the pile turned over and the other side scanned. When this has been done the scanned pages are out of order. This was done with the first two scanners I had which were not able to handle double sided pages.
  • OK( ord): the pages of one or more of the scanned files are out of order for a reason other than the ‘dbl’ reason above.
  • OK(left): the original document was deliberately left at the location of the employer concerned when I moved jobs.
  • OK(A5): one of the scanners I had was not able to handle A5 pages reliably and sometimes recorded a line as an image dragged down the page for an inch or more.

Should an XX notation be applied to an Index entry, the reason it is being noted as such is recorded in brackets with one or more of the following notations:

  • XX(lost): the paper document was lost before a scan could be taken, so the Index entry is the only trace left of this document.
  • XX(ref): The Reference No is duplicated or incorrect in some other way.
  • XX(pap): The document is still only in paper form because its form is such that it has not yet been possible to digitise it effectively.

The fact that such points and issues are present in the collection in noticeable numbers, simply reflects the fact that, when dealing with such large volumes of material in the course of performing busy jobs across many years, it is inevitable that things will go wrong and mistakes will be made. Having been through the whole of the index, I’ll have statistics about the overall prevalence of such issues in this particular collection.

Search Status

In my last entry I said I’d contacted six potential repositories for my lifetime document collection. This is where those communications are up to:

Loughborough University’s Centre for Information Management: My email was forwarded to the University Library which did not respond. I followed this up on 20th September with an email to the Director of Library Services, and am waiting for a reply.

Manchester University’s Computer Science Dept Research Office: My email was forwarded to a researcher with an interest in the history of computing, but that person replied saying that her work in that area had been put on hold. On 20Sep I used the University library general enquiry form to enquire if the library would be interested. The library advised me to contact the Head of School administration  in the School of Computer Science who I duly emailed on 2nd October, and I am awaiting a reply.

City University’s Cass Business School:  My contact said he would pass my email onto his colleagues and I have heard nothing further.

UCL’s Dept of Information Studies: My contact said she would look out for interested people at conferences.

The National Archives: I was advised by my contact to direct my question to the Archive Sector Development team which, while it does not have any direct provision for taking private collections, should be well placed to provide advice. I emailed the Development Team on 16th September and am waiting for a reply.

 The Science Museum Wroughton Library and Archives: The Library asked me a number of questions about the collection but finally responded by sayingThank you for allowing us the time to consider your collection which we have now discussed with the Archive Collections Manager, Science Museum’s Keeper of Technologies and Engineering, and Head of Library and Archives. We have concluded that whilst we find this a most interesting idea, we do not think that the content fits within our current collecting policy criteria. You may have already contacted them, but we suggest that the National Computing Collection might be a more appropriate repository for your collection.”

This is all pretty much as expected: I know its going to be hard to find a repository that’s interested. However, should there be no interest from any of the above organisations, I plan to rely on interest being generated by the publication of my paper on Digital Preservation Planning.

Personal Preservation Planning Paper

I see my last entry was in late June – a few weeks before moving house in mid-July. Since then, a full programme of packing, unpacking and house renovation has affected progress. However, I completed a paper describing my experiences in identifying a preservation planning process for personal collections, just over a week ago (on the day my granddaughter was born!). It included templates for a Scoping Document, a Digital Preservation Plan, A Digital Preservation Chart, and a Preservation Maintenance Plan – which I have produced in the belief that the document formats derived in the course of this work may be of use to others. I am now in the process of trying to find a publisher.

The completion of the preservation planning trials and the associated paper, now puts me in a position to undertake preservation planning on my lifetime collection of work documents. I would prefer to conduct this exercise (which will be a major challenge) in conjunction with the destination repository for the collection; but, in the absence of such a repository, with any other organisations and individuals who might be interested. I am devoting my efforts in the immediate future to identifying such organisations or individuals. Please get in touch if you would like to be involved.

Second Trial Finished

Yesterday I finished my second trial of the Preservation Planning process on the 17,000+ files of our photo collection. While the preservation activities were a little laborious and time-consuming by their very nature, there’s no doubt that the planning process was a whole lot easier the second time around with the benefit of the experience and template documentation from the first trial. The first trial highlighted a need to undertake pre-planning work on file formats, and I duly did this to good effect – that aspect will become embedded in the process  going forward.

I believe I now have enough knowledge to embark on Preservation work on my PAW/DOC document collection (which was the original objective of the two trials) sometime early next year. First, though, I’ll update the documentation and templates to reflect the findings of this second trial. I think I may also write up my findings in the form of a journal paper since, as I discovered when I started this work, such process guidance for owners of personal collections does not seem to be freely available.

Giant Jigsaws

I’ve seen very, very small jigsaws – but no very, very big jigsaws. Wouldn’t they be a good accompaniment to giant chess and giant connect-4 in a garden or park?

Practical OFC

A few weeks ago a discussion with a friend prompted me to try to write down some brief practical guidance for achieving Order from Chaos. The resulting Practical approaches to Order from Chaos summarises into the following steps:

  1. Be clear about what specific categories of stuff you want to deal with.
  2. Figure out why you want to keep each category.
  3. Take the opportunity to reassess what you really want to keep/collect going forward.
  4. Decide where the best place is to store each category.
  5. Be clear about the specific space you are going to use for storage.
  6. Decide how you will set out, equip and organise the storage space.
  7. Do a quick sort of each category into sub-categories.
  8. As you do the quick sort, set aside the stuff you want to throw away or dispose of.
  9. Decide exactly how you will store each sub-category.
  10. Go through each sub-category in detail, organising as required.
  11. Set up the storage space/containers/equipment and store the sorted sub-categories.

Should anyone try to apply these suggestions, I’d be interested to hear how you got on.