The importance of curation in a metadata driven information architecture

Posted on March 6, 2010
Filed Under Semantic Web, information architecture, journalism, semantic information architecture | 5 Comments

How do you retain a sense of editorial voice and craft as information architectures become increasingly metadata driven?

In my work with BBC Journalism we have been attempting to take the philosophy of Tom Scott’s Wildlife Finder and applying it to News and Sport. Our starting point has been the Winter Olympics.

The step change was in creating a populated domain model for the games. The things that made up this vocabulary were used by journalists to tag their stories. The tagged stories were then aggregated automatically onto sports indexes. This allowed us to create many more indexes than would have been possible with manual management.

Overall the project was a great success but it raised some interesting questions. The design of the indexes was created by the user experience team. The algorithms were written by developers and informed the ordering of the stories.  This left journalists to simply tag stories and watch their stories appear on indexes they had no control over. It certainly felt like their influence on part of the product had moved a step away from them. This was reflected in journalists’ feedback and the frequent questions about how to game the system to control the order of stories on indexes.

So the questions are:

  • How do you enable the journalists to feel in control of the story telling?
  • How to do this without introducing tags for value judgements?
  • How do you ensure that the site has voice and feels editorialised – as opposed to being simply lists of dynamically aggregated data?

Tom Scott has convinced me the answer is the concept of the collection (and variations on this theme). The collection replicates the manually managed index of stories with a structured list of things. The Wildlife finder example is David Attenborough’s favourite moments. A very simple example for sport might be the best goals of the World Cup. Although this does not seem particularly radical, the beauty of it is that the curatorial layer is built on top of a domain modelled approach.

Because the things that live in our model are associated with assets and data,  the journalist, in selecting a thing to include in a collection pulls data through the system.

Take the same example of the best goals of the World Cup. A journalist would select their top ten goals of the tournament. As the journalist identifies and pulls things through the system into the collection the context around those goals are pulled with them. So the game they were scored in, the importance it had and information about the goal scorers record in the tournament.

Why it is not tagging:

It is important to distinguish the process of creating a collection from the act of tagging. Tagging associates content with things in the domain model. Journalists tagging stories ensure we build up a consistent mapping of the editorial content to the things (and/or concepts) in our domain.

The process of creating collections is closely tied to the editorial judgement of those curating them. Tagging clips with the tag good goal and then anonymously aggregating them is not.

Why it empowers journalists:

The Guardian has found the balance in their topic pages by allowing an editor to pick a story to be displayed at the top of every automated page. But does this go far enough? This still sits very much within the document model of storytelling. What a collection (or similar) begins to allow is a true web adaptation of a news story.

It is the curatorial layer and the use of collections that will allow organisations to reflect voice, perspective and expertise.  How this will improve the experience for the news reader will be the subject of this blog over the forthcoming months.

Could the means by which news organisations adapt their story telling using tools like collections be the key to their ongoing survival?

Collections part 1: Collections of links

Comments

5 Responses to “The importance of curation in a metadata driven information architecture

  1. Paul Rissen on March 7th, 2010 12:06 pm

    Interesting and useful write-up – and thanks for the link!

    I’d like to push the concept of ‘collection’ further, and say that it’s *exactly* what the role of the journalist should be – it’s the web version of a traditional news story. A journalist writes an article by collecting various sources together and making some informed links between them. Using linked data and ‘collections’, the journalists can do this in a much more literal way, allowing the audience to both ‘read’ the story/collection brought together by the journalist, and explore further to see other connections made by other journalists on the same topics.

  2. Mike Atherton on March 9th, 2010 9:57 am

    @Paul – eventually (and in some cases) perhaps, but that assumes all the journalist’s primary sources are addressable as linked data. It’s hard to see how interviews/classified documents/deep-throat-style clandestine meetings could be exposed as story sources. By all means have footnotes and inline links that show your working for those who want to dig deeper, but generally people don’t want to have to work too damn hard just to read a story. I’d say let journos be journos, and let editors take on the curation of collections.

    But I do think collections are an essential top layer of domain-modelled sites. DM sites are ‘just’ web-facing relational databases, which can make them a bit impenetrable and soulless. Wildlife Finder really only becomes a mass-market proposition through collections that cut through the option paralysis and surface the ‘good’ stuff.

  3. Some thoughts on moving beyond the resource « Derivadow.com on March 11th, 2010 11:00 pm

    [...] unit. Silver Oliver has recently written about why he thinks this approach is important, why curation in a metadata driven information architecture – it’s a very good post — you should read it. But I thought I would share a bit [...]

  4. Paul Rissen on March 12th, 2010 10:23 am

    Collections, and/or some other form of curation, are important for helping the audience find good stuff, but I don’t think it should be *all* we do, because to me it feels like it’s replicating the linear narrative on the web. Granted, that’s not a bad thing, but the power of the web is that we can open up the stories we tell to the audience, so they can explore the raw materials themselves, and understand/enjoy our stories better…

  5. Georgi Kobilarov » Linked Data and Enterprise Data Integration on March 24th, 2010 2:17 pm

    [...] becomes the place where the links between data objects get managed, where data collections get curated, where it gets defined which data sources and pieces of information to trust for which use case, [...]

Switch to our mobile site