Collections, datasets and compound objects

A lot of the terminology in this can be highly confusing. Here’s a quick glossary

  • Object –
  1. An unique (to the name space of the work space) identifier
  2. Metadata
    • Name
    • URN
  3. Bitstream that stores a digital representation of an artifact or text or metadata instance
  • Compound object – basically the same as an object except that it contains multiple representaions of an object, eg a tiff image of a manuscript page and a tei encoded representation of the text
  • Dataset – what scientists and social scientists call an object. Basically a set of ordered readings plus metadata defining the columns of the readings
  • Collection – a group of objects collected together on a thematic basis. This will vary by discipline but examples could be date and time of data capture, geographic location of data capture, etc

Datasets are usually simple objects rather than compound objects – this is due to them being generated from a single instrument. It is possible to imagine a scenario in which two different analyses of the data are stored together.

About dgm

Former IT professional, previously a digital archiving and repository person, ex research psychologist, blogger, twitterer, and amateur classical medieval and nineteenth century historian ...
This entry was posted in Uncategorized. Bookmark the permalink.

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s