Re: [backstage] Geotagging BBC news stories

2005-07-25 Thread Tony Hirst

With the speed with which the annotation stream idea is being worked up, I
can see the time is ripe to revisit the idea of an information mixing desk
(cf an audio mixing desk).

Info feeds come in at the top, stories can get boosted/attenuated in the
presentation order according to eg timeliness, keywords, locale etc etc
(cf bass, treble etc), effects applied to feeds correspond perhaps to
annotation feeds, perhaps to info pulled in from other services (eg if a
story/feed is tagged, then delicious can be mined for related pages, and
perhaps even related feeds  (hmm - does delicious support rss or
webservice linktypes, so you can just search on those?)

But that's by the by...

What I meant to post was:

As I understand it, the annotationStream idea allows a user to provide a
set of annotations that can be applied to a particular original feed. But
what if I want to define a feed that is a combination of several distinct
annotationStreams defined at quite a high level? 

is there any merit in going a step further and having an annotationMix (or
annotationPatch?) (that may or may not be an annotationStream?) that
contains eg one or more useAnnotationStream
annotationURI=URI-for-annotation-service / tags and defines which bits
of those various annotations from each feed should be used in the final
annotationMix?

It wouldn;t go as far as defining an xslt to remix the final feed, but 
would be useful as a config file for a mixer of several feeds, perhaps, or
setting up an XSLT to generate the final annotated/mixed feed?

tony


-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.


RE: [backstage] Geotagging BBC news stories

2005-07-25 Thread Dr R Brittain
Hello Tony,
Do you knock up prototypes?

Dr. Roger Brittain
The Old Rectory
Brinklow
Warwickshire
CV23 0NE
Great Britain
Telephone (44) 01788 832 660
Mobile 07821 184 074
Email [EMAIL PROTECTED] 

-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Tony Hirst
Sent: 25 July 2005 1:04 PM
To: backstage@lists.bbc.co.uk
Subject: Re: [backstage] Geotagging BBC news stories


With the speed with which the annotation stream idea is being worked up, I
can see the time is ripe to revisit the idea of an information mixing desk
(cf an audio mixing desk).

Info feeds come in at the top, stories can get boosted/attenuated in the
presentation order according to eg timeliness, keywords, locale etc etc
(cf bass, treble etc), effects applied to feeds correspond perhaps to
annotation feeds, perhaps to info pulled in from other services (eg if a
story/feed is tagged, then delicious can be mined for related pages, and
perhaps even related feeds  (hmm - does delicious support rss or
webservice linktypes, so you can just search on those?)

But that's by the by...

What I meant to post was:

As I understand it, the annotationStream idea allows a user to provide a
set of annotations that can be applied to a particular original feed. But
what if I want to define a feed that is a combination of several distinct
annotationStreams defined at quite a high level? 

is there any merit in going a step further and having an annotationMix (or
annotationPatch?) (that may or may not be an annotationStream?) that
contains eg one or more useAnnotationStream
annotationURI=URI-for-annotation-service / tags and defines which bits
of those various annotations from each feed should be used in the final
annotationMix?

It wouldn;t go as far as defining an xslt to remix the final feed, but 
would be useful as a config file for a mixer of several feeds, perhaps, or
setting up an XSLT to generate the final annotated/mixed feed?

tony


-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.


-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.


Re: [backstage] Geotagging BBC news stories

2005-07-25 Thread Tony Hirst
backstage@lists.bbc.co.uk on 25 July 2005 at 15:42 + wrote:


Do you knock up prototypes?

Yes - given time (in fact, i only ever do prototypes, and then it's coded
with cannibalised code that hangs together with the software equivalent of
gaffer tape.i don;t think i've ever got anything as far as a late
beta...;-)

however, i'm blagging minutes rather than hours for not-work stuff at the
mo, and the pace of change on this list means that prototypes are
appearing at the speed of thought... if only i could get time to put a bid
in for the info mixing desk as a proper research project...(gazes
longingly into the distance)

[resonates with this from Matt.
I wish I could buy some time to get the annotation streams thing
up and running!
]

re the info mixing desk - I thought i'd blogged about this a bit ago, but
can;t find it anywhere expect for a couple of pages of scribbled notes in
an old logbook, so i'm afraid i can;t link you to more detailed thoughts
just at the mo...

Original idea was just a rap on an rss equivalent of something like this:
http://www.theatrecrafts.com/sound_mixingdesk.html

that was more than eg just this: 
http://www.feedcombine.co.uk/st/content/makefeed/ (which doesn;t appear to
be working at the mo?)

Cf the sounddesk, the line feeds are RSS feeds. Eq allows you to filter
results on keywords, perhaps (eg like +terms and -terms in search
engines), auxiliaries i saw as sending the rss to some sort of feed
processor that returns something pulled from the original feed (tho now i
realise that what the aux's would more usefully do is provide Matt's
annotationStreams...)
 
The different channels/faders, allow you to set the weight of items from
different RSS inputs(with added FX/annotations) in the overall output
infomix. This could be another RSS feed, or it could be data plotted on a
google map, or some other visualisation.

The sounddesk graphic doesn;t have mute/solo buttons, etc, but they'd be
there to allow the infomixer to check individual feeds etc.

I had a scout around some time ago for exemplars (lazy as i am) and about
the only thing i came across at the time that wasn't an RSS aggregator was
this text mixing desk
http://www.lazaruscorporation.co.uk/v4/cutup/textinput.php
which is NOT the sort of thing i was thinking of... but it probably takes
you to interesting places if you think around it a bit...

In terms of FX, it's also quite a fruitful starting point  thinking of
info analogues to sound FX, though they soon breakdown - so for example an
echo on a blog posting might be a quote from, or interpretation of, that
blog on another blog;

Re annotation feeds - another useful 'annotation' might be images or
movies to illustrate a feed (so the info mising desk needs an 'add image'
effect..:-)

tony

---
SEE THE WICKED ROBOT INVASION MAP
http://www.wickedrobots.co.uk/
---
Mail tags:
---
Tony Hirst
mailto:[EMAIL PROTECTED]
blog: http://micro-info.blogspot.com/

Dept. of ICT, Faculty of Technology
Open University, Walton Hall,
Milton Keynes, MK7 6AA, UK

Tel: +44 (0)19086 52789, m./SMS 07709 766223

http://robofesta.open.ac.uk/tony
http://www.robofesta-uk.org



-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.  
Unofficial list archive: http://www.mail-archive.com/backstage@lists.bbc.co.uk/


Re: [backstage] Geotagging BBC news stories

2005-07-23 Thread Matthew Hurst
Here is the blog entry:

http://datamining.typepad.com/data_mining/2005/07/feed_annotation.html

Feed Annotation Streams

With the help of Davy Mitchell, and the input of others on the
backstage discussion list (Tony Hirst, Duncan Barclay, and others) we
have put together the first proof of concept of Feed Annotation
Streams (formerly known as RSS Annotation Streams). The idea is simple
- consumers of a feed create annotations for that feed, the put those
annotations in a new feed that refers to the original items by guid.

The example is based on the BBC RSS news feed and Davy's mood score
(is the news good or bad?) for the articles. The RSS feed is consumed
in DataSphere and geotagged, the items are then referenced against the
mood score FAS and coloured. The colouring I applied is very simple.
If the sum of the mood scores for a location is positive, the pin is
painted green, negative red and zero white.

There is a lot still to do in terms of defining the spec, but it looks
like it has great potential.

Fas

The Feed Annotation Stream looks something like:

annotationStream feedURL=original-feed-url
annotationURI=URI-for-annotation-service feedRead=date
annotationStreamWrite=date
  annotation guid=item-guid
!--annotation information here--
  /annotation
/annotationStream
On 7/22/05, Matthew Hurst [EMAIL PROTECTED] wrote:
 Ok - here is the very first instance of using a Feed Annotation Stream
 together with a feed from the bbc to produce geolocated, mood mapped
 news. The image shows a sample of data from a recent grab of rss from
 the beeb mixed with reading Davy's mood annotation
 feed. The pins indicate the volume of posts and the colours indicate
 the sum of the mood scores for that locatoin was negative (red) and
 positive (green).
 
 Phew!
 
 Having done this (with throw away code) I have some comments:
 
 1) guid - needs to be the guid from the rss feed, not the (perma) link
 to the story.
 Davy - I munged the urls you had in there to provide actual guids.
 2) unique? there were some duplicates, but i guess this doesn't matter.
 3) Feed Annotation Stream is a better name as there is no reason it
 has to be for
 rss - anything with an identifiable guid would do.
 
 I'm probably going to blog this later and repeat the post to this
 list. I'd also like to
 put up a complimentary FAS for this data with the geolocation stuff in
 it to give a complete
 proof of concept.
 
 Davy - great stuff here - let me know what you would like to do next.
 I am working towards getting my stuff visible outside the company -
 these wheels move a little slower as there are some purchases
 involved. I would like to get more work done on the spec and to host
 it off my blog somehow. Please throw any comments up on this list!
 
 MattH
 
 On 7/22/05, Matthew Hurst [EMAIL PROTECTED] wrote:
  More thoughts: the annotation spec should include information about when the
  rss feed was crawled and also when the annotation stream was created.
 
  I have some data now (both your annotation and some rss files with
  matching guids) saved to disc, I'll post the results sometime this
  weekend.
 
  MattH
 
  On 7/22/05, Matthew Hurst [EMAIL PROTECTED] wrote:
   This is great. I will get something together asap.
  
   Thanks!
  
   Matt
  
   On 7/22/05, Davy Mitchell [EMAIL PROTECTED] wrote:
On 7/22/05, Matthew Hurst [EMAIL PROTECTED] wrote:
 I'm still for something like the original file format I suggested
 earlier. Simplicity is very attractive in this type of endevour.
   
:-)
   
http://www.latedecember.com/sites/moodnews/moodnewsanno.xml
   
Its one big xml of all the feeds as I don't store the feed URL in the
db (yet!). Should be enough for a demo... Email me if you need
anything changed etc.
   
Interesting stuff!!
   
Mood News development trundles on with a mix of back end and client
stuff being added. It has been slowed by me hunting for the perfect
Python IDE. A new version next week perhaps?
   
Thanks,
   
Davy Mitchell
   
http://www.latedecember.com
   
   
-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, 
please visit 
http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.
   
  
 
 
 



-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.


Re: [backstage] Geotagging BBC news stories

2005-07-23 Thread Davy Mitchell
On 7/23/05, Matthew Hurst [EMAIL PROTECTED] wrote:
 Here is the blog entry:
 
 http://datamining.typepad.com/data_mining/2005/07/feed_annotation.html
 
 Feed Annotation Streams

Looks good - but I might have to hack the Python RSS library I am
using to get the GUID tag out of it :-) I'll have a go at updating the
XML with more data such as supplying the colour data used on the web
page instead of the raw score.

If 2 or 3 of these feeds are combined. well the mind boggles!

Off bug squishing...

See you,
Davy

-- 

Davy Mitchell

http://www.latedecember.com


-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.


Re: [backstage] Geotagging BBC news stories

2005-07-23 Thread Matthew Hurst
Davy - yes, I think there is a real opportunity here for something.
I'm imagining a
ping server for FAS data. There is open source java code for a ping
server, I bet we could trivially mod it to work for FAS data. If we
could put this up and provide a front end (very very simple thing,
just a list of all the annotation streams that have updated in the
last n minutes)...

What do you think? Do you have any server resources that we could use?
If not, I'm pretty sure I can get my hands on something.

BTW, note that the annotationURI i the annotationStream tag is meant
to point to a specification for the annotations in the annotation
node, not a pointer to the actual
file. In otherwords, you need to have a mood annotation specification
somewhere (e.g. stating that negative means bad, positive means good
and 0 means neutral). The way
URIs work is that they don't have to actually resolve to something,
they just have to *mean* something. However, it is far more convenient
if the URI points to an actual document that states the spec.

I will work on something more formal as a definition.

MattH

On 7/23/05, Davy Mitchell [EMAIL PROTECTED] wrote:
 On 7/23/05, Matthew Hurst [EMAIL PROTECTED] wrote:
  Here is the blog entry:
 
  http://datamining.typepad.com/data_mining/2005/07/feed_annotation.html
 
  Feed Annotation Streams
 
 Looks good - but I might have to hack the Python RSS library I am
 using to get the GUID tag out of it :-) I'll have a go at updating the
 XML with more data such as supplying the colour data used on the web
 page instead of the raw score.
 
 If 2 or 3 of these feeds are combined. well the mind boggles!
 
 Off bug squishing...
 
 See you,
 Davy
 
 --
 
 Davy Mitchell
 
 http://www.latedecember.com
 
 
 -
 Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
 visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.



-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.


Re: [backstage] Geotagging BBC news stories

2005-07-23 Thread Peter Williams




Hi

a couple of thoughts

The use of Geotagging could be more useful if the tags were split
according to the location of the author, location of the central story
elements and the location where the story was published. For example, a
story about Mugabe (location tag Zimbabwae), is filed
from South Africa (location Tag Joberg), and "published"
in the UK (location tag London/UK)

In this way it would be possible to group stories by perspective, i,e
the Australian press version of the London bombings, or the UK
perspective on Mugabe as well as grouping by location of the stories,
and their authors. This may link in well with any catagorization
approaches to news feeds.

A look up server that contained well known locations expressed in
Lat/Long would make the application of the tag a simple process. If
extended to the annotators, then one could the see the location based
perspective of the commentator. 


Regards


Peter

(located in Australia- somewhere in the south pacific) 





Matthew Hurst wrote:

  More thoughts: the annotation spec should include information about when the
rss feed was crawled and also when the annotation stream was created.

I have some data now (both your annotation and some rss files with
matching guids) saved to disc, I'll post the results sometime this
weekend.

MattH

On 7/22/05, Matthew Hurst [EMAIL PROTECTED] wrote:
  
  
This is great. I will get something together asap.

Thanks!

Matt

On 7/22/05, Davy Mitchell [EMAIL PROTECTED] wrote:


  On 7/22/05, Matthew Hurst [EMAIL PROTECTED] wrote:
  
  
I'm still for something like the original file format I suggested
earlier. Simplicity is very attractive in this type of endevour.

  
  :-)

http://www.latedecember.com/sites/moodnews/moodnewsanno.xml

Its one big xml of all the feeds as I don't store the feed URL in the
db (yet!). Should be enough for a demo... Email me if you need
anything changed etc.

Interesting stuff!!

Mood News development trundles on with a mix of back end and client
stuff being added. It has been slowed by me hunting for the perfect
Python IDE. A new version next week perhaps?

Thanks,

Davy Mitchell

http://www.latedecember.com


-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.

  

  
  

-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.


  


-- 
Peter Williams
CEO
Eurofin Technologies

www.eurofin.net

T +612 9968 1240
F +612 9968 1552




Re: [backstage] Geotagging BBC news stories

2005-07-23 Thread Matthew Hurst
I've been thinking about servers wrt annotations. Why wouldn't you
just have a server that takes a query with a guid and produces an
annotation? I thin kthere is a really good reason for not going for
this at the first cut: simplicity. If everyone out htere who is
providing meta data for rss feeds has to also provide a server, then
that is a high barrier to entry. If all they have to do is run a web
server and ping a ping service, that is easier. It would then be
trivial to create a server that reads these annotations and acts as a
clearing house with as much query based service as possible.

Peter, your comments about location are spot on. In fact I tag all my
rss feeds, where possible with locatoin data (where the feeds are
published). I know that serious feed based services (like
moreover.com) tag feeds with a huge number of meta data (40?). Having
a server that, given a location, produces coords - this makes sense
for fixed meta data, but then, if this is fixed, it would be
preferable to only look the infor up once when entering the feed into
a database, not to do it every time an article from the feed is
published.

I think one of the things you are getting it, which sounds really
cool, is something that would be able to say 'here is how people in
turkey view the bombings in london'.

Perhaps you are extending the annotation idea (which is item oriented)
to feeds. In
other words, annotations for a feed (including the information about
where the feed
is from) could be delivered by a lookup service? 

Matt

On 7/23/05, Peter Williams [EMAIL PROTECTED] wrote:
  Hi
  
  a couple of thoughts
  
  The use of Geotagging could be more useful if the tags were split according
 to the location of the author, location of the central story elements and
 the location where the story was published. For example, a story about
 Mugabe (location tag Zimbabwae), is filed from South Africa (location Tag
 Joberg), and published in the UK (location tag London/UK)
  
  In this way it would be possible to group stories by perspective, i,e the
 Australian press version of the London bombings, or the UK perspective on
 Mugabe as well as grouping by location of the stories, and their authors.
 This may link in well with any catagorization approaches to news feeds.
  
  A look up server that contained well known locations expressed in Lat/Long
 would make the application of the tag a simple process. If extended to the
 annotators, then one could the see the location based perspective of the
 commentator. 
  
  
  Regards
  
  
  Peter
  
  (located in Australia- somewhere in the south pacific) 
 
  
  
  
  
  
  Matthew Hurst wrote: 
  More thoughts: the annotation spec should include information about when
 the
 rss feed was crawled and also when the annotation stream was created.
 
 I have some data now (both your annotation and some rss files with
 matching guids) saved to disc, I'll post the results sometime this
 weekend.
 
 MattH
 
 On 7/22/05, Matthew Hurst [EMAIL PROTECTED] wrote:
  
  
  This is great. I will get something together asap.
 
 Thanks!
 
 Matt
 
 On 7/22/05, Davy Mitchell [EMAIL PROTECTED] wrote:
  
  
  On 7/22/05, Matthew Hurst [EMAIL PROTECTED] wrote:
  
  
  I'm still for something like the original file format I suggested
 earlier. Simplicity is very attractive in this type of endevour.
  
  :-)
 
 http://www.latedecember.com/sites/moodnews/moodnewsanno.xml
 
 Its one big xml of all the feeds as I don't store the feed URL in the
 db (yet!). Should be enough for a demo... Email me if you need
 anything changed etc.
 
 Interesting stuff!!
 
 Mood News development trundles on with a mix of back end and client
 stuff being added. It has been slowed by me hunting for the perfect
 Python IDE. A new version next week perhaps?
 
 Thanks,
 
 Davy Mitchell
 
 http://www.latedecember.com
 
 
 -
 Sent via the backstage.bbc.co.uk discussion group. To unsubscribe, please
 visit
 http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.
 
  
  
 -
 Sent via the backstage.bbc.co.uk discussion group. To unsubscribe, please
 visit
 http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.
 
 
  
  
  -- 
 Peter Williams
 CEO
 Eurofin Technologies
 
 www.eurofin.net
 
 T +612 9968 1240
 F +612 9968 1552



-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.


Re: [backstage] Geotagging BBC news stories

2005-07-22 Thread Davy Mitchell
On 7/21/05, Tony Hirst [EMAIL PROTECTED] wrote:
 backstage@lists.bbc.co.uk on 21 July 2005 at 21:52 + wrote:
  it seems you have the same motivation as I had when I mailed this list
 about RSS Annotation Streams!

The whole idea sounds great. I have plans to expose some of the Mood
News data but never thought of anything quite so detailed. How about
an XML with the URL as GUID plus the data (rating, colour etc) ? Would
it need a basic schema structure?

Could lead to some very interesting client applications!

Thanks,
Davy

-- 

Davy Mitchell

http://www.latedecember.com


-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.


Re: [backstage] Geotagging BBC news stories

2005-07-22 Thread Ben O'Neill
Is it against the TCs to provide an RSS file that is 99% the same as
the one on the BBC but with geographical information added?

On 21/07/05, Tony Hirst [EMAIL PROTECTED] wrote:
 backstage@lists.bbc.co.uk on 21 July 2005 at 21:52 + wrote:
  it seems you have the same motivation as I had when I mailed this list
 about RSS Annotation Streams!
 
 missed that - ah:
 http://www.mail-archive.com/backstage@lists.bbc.co.uk/msg00330.html and
 http://datamining.typepad.com/data_mining/2005/07/rss_annotation_.html
 
 I agree with the sentiment, and then some - if someone goes to the trouble
 of adding value to a story/feed, it would be good if those extras could be
 syndicated separately but identifiably too without others having to
 reinvent/reimplement the same technique/recreate the same annotation data.
 
 Perhaps this is format that we want to develop
 further?
 
 That sounds like an interesting (and perhaps useful) exercise...where to
 start? i tend to work back to generalised solutions from a couple of
 similar but different implementations of a thing...would it be fair to say
 that the annotation stream on its own would be relatively worthless
 without the original story it was annotating? or could you imageine it
 ever standing alone on its own terms?
 
 
 Back to the geotagged BBC stories, even something as simple as returning
 lat/long when passed a story ID would be v reusable...
 
 For example:
 http://backstage.bbc.co.uk/prototypes/archives/2005/05/a_map_of_the_ne.html
  http://boneill.ninjagrapefruit.com/wp-content/bbc/newmaps/ divines geo
 data
 
 and  Duncan's http://backstage.min-data.co.uk/geotagged/  must have a
 script for locating stories
 So if someone can make story id/geotag info available, then I'm sure the
 lazy community (of which I count myself a part) would be v grateful :-)
 
 tony
 
 
 -
 Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
 visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.
 


-- 
Ben O'Neill
[EMAIL PROTECTED]


-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.


Re: [backstage] Geotagging BBC news stories

2005-07-22 Thread Matthew Hurst
I think the separation of the original data from the annotation is attractive.
Yes, the annotation would be almost meaningless without the original data, but
it would save the annotation owner from republishing the original content and
any considerations of legality, etc. In addition, it would drive
traffic to the source
of the original content (always a good thing).

There are (at least) two possible approaches: a query approach (give
me your annotation for
this guid) and a syndicated approach. The syndicated approach is less
of a resource strain as the provider just has to deliver something
very similar to rss, they don't have to have a query service running,
just a document server (web server).

In terms of developing the idea, we could certainly start with getting
feedback in this forum and even have some implementations provide a
service (Davy?). But I'd also like to hear any advice about how we can
rapidly formalize this and get the idea out there along with a
proposal for an api/document format. If only we had that wiki we were
all so excited about up and running...

As the format is simple (guid mapping to some bundle of information),
I guess we are thinking of something like:

..
annotationStream feedURL= bbc feed that is being annotated
annotationURI=http://www.latedecember.com/newsMood.xml;.
annotation guid=...
!-- put your annotation here--
/annotation
...more of above...
/annotationStream

Note that the stream points to both the rss feed and the annotation
information (which ought to describe the syntax and semantics of the
element at !--put your annotation here--)

Davy, if you put a file up like this, I could easily integrate it into
my data sphere thing and put up some screen shots -  we could claim
that constitutes the first implementation!

Matt

http://datamining.typepad.com

On 7/22/05, Davy Mitchell [EMAIL PROTECTED] wrote:
 On 7/21/05, Tony Hirst [EMAIL PROTECTED] wrote:
  backstage@lists.bbc.co.uk on 21 July 2005 at 21:52 + wrote:
   it seems you have the same motivation as I had when I mailed this list
  about RSS Annotation Streams!
 
 The whole idea sounds great. I have plans to expose some of the Mood
 News data but never thought of anything quite so detailed. How about
 an XML with the URL as GUID plus the data (rating, colour etc) ? Would
 it need a basic schema structure?
 
 Could lead to some very interesting client applications!
 
 Thanks,
 Davy
 
 --
 
 Davy Mitchell
 
 http://www.latedecember.com
 
 
 -
 Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
 visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.



-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.


RE: [backstage] Geotagging BBC news stories

2005-07-22 Thread Joel Chippindale
This sounds like it could be built on top of one of a social bookmarking
sites.

Then you could post your (geo)tags of BBC News stories to your social
bookmarks account using the site's API...and others could use the same
API to interrogate the site for the tags that have been associated with
a particular BBC News story by asking for all the tags associated with
the story's URL.

It doesn't look like this would be possible using del.ico.us because
it's API http://del.icio.us/doc/api does not enable you to request all
the tags/extended descriptions used for a particular URL. However
perhaps one of the open source implementations e.g. http://de.lirio.us/
or http://sourceforge.net/projects/scuttle/ could be modified to enable
this?

Joel


-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Matthew Hurst
Sent: 22 July 2005 10:09
To: backstage@lists.bbc.co.uk
Subject: Re: [backstage] Geotagging BBC news stories


I think the separation of the original data from the annotation is
attractive. Yes, the annotation would be almost meaningless without the
original data, but it would save the annotation owner from republishing
the original content and any considerations of legality, etc. In
addition, it would drive traffic to the source of the original content
(always a good thing).

There are (at least) two possible approaches: a query approach (give me
your annotation for this guid) and a syndicated approach. The syndicated
approach is less of a resource strain as the provider just has to
deliver something very similar to rss, they don't have to have a query
service running, just a document server (web server).

In terms of developing the idea, we could certainly start with getting
feedback in this forum and even have some implementations provide a
service (Davy?). But I'd also like to hear any advice about how we can
rapidly formalize this and get the idea out there along with a proposal
for an api/document format. If only we had that wiki we were all so
excited about up and running...

As the format is simple (guid mapping to some bundle of information), I
guess we are thinking of something like:

..
annotationStream feedURL= bbc feed that is being annotated
annotationURI=http://www.latedecember.com/newsMood.xml;.
annotation guid=...
!-- put your annotation here--
/annotation
...more of above...
/annotationStream

Note that the stream points to both the rss feed and the annotation
information (which ought to describe the syntax and semantics of the
element at !--put your annotation here--)

Davy, if you put a file up like this, I could easily integrate it into
my data sphere thing and put up some screen shots -  we could claim that
constitutes the first implementation!

Matt

http://datamining.typepad.com

On 7/22/05, Davy Mitchell [EMAIL PROTECTED] wrote:
 On 7/21/05, Tony Hirst [EMAIL PROTECTED] wrote:
  backstage@lists.bbc.co.uk on 21 July 2005 at 21:52 + wrote:
   it seems you have the same motivation as I had when I mailed this 
  list about RSS Annotation Streams!
 
 The whole idea sounds great. I have plans to expose some of the Mood 
 News data but never thought of anything quite so detailed. How about 
 an XML with the URL as GUID plus the data (rating, colour etc) ? Would

 it need a basic schema structure?
 
 Could lead to some very interesting client applications!
 
 Thanks,
 Davy
 
 --
 
 Davy Mitchell
 
 http://www.latedecember.com
 
 
 -
 Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, 
 please visit 
 http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.



-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe,
please visit
http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.


-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.


Re: [backstage] Geotagging BBC news stories

2005-07-22 Thread Matthew Hurst
(reposting)

I think the volume of data that is potentially in this would not be suitable
for a social bookmarking system. Imagine geocoding every news article in
every news rss feed out there (say 10, 000 feeds).

As for the redistribution of the rss content, the separation has one
huge benefit.
It means that a consumer could pick multiple annotation streams for a single rss
feed. For example, I could listen t obbc rss, a geotagging stream and a mood
stream and put them together (that is precisely what I want to do!). If the
geotagger included geotagging and the rss but not the mood, and
similarly the mood
stream had the rss data, mood but not geo data, I would still have to
do the alignment
of differnet data. By making the separation part of the specificatoin,
the whole thing
is a lot cleaner and extensible - it all boils down to correct usage of guid.

I'm still for something like the original file format I suggested
earlier. Simplicity
is very attractive in this type of endevour.

Matt

On 7/22/05, Joel Chippindale [EMAIL PROTECTED] wrote:
 This sounds like it could be built on top of one of a social bookmarking
 sites.
 
 Then you could post your (geo)tags of BBC News stories to your social
 bookmarks account using the site's API...and others could use the same
 API to interrogate the site for the tags that have been associated with
 a particular BBC News story by asking for all the tags associated with
 the story's URL.
 
 It doesn't look like this would be possible using del.ico.us because
 it's API http://del.icio.us/doc/api does not enable you to request all
 the tags/extended descriptions used for a particular URL. However
 perhaps one of the open source implementations e.g. http://de.lirio.us/
 or http://sourceforge.net/projects/scuttle/ could be modified to enable
 this?
 
 Joel
 
 
 -Original Message-
 From: [EMAIL PROTECTED]
 [mailto:[EMAIL PROTECTED] On Behalf Of Matthew Hurst
 Sent: 22 July 2005 10:09
 To: backstage@lists.bbc.co.uk
 Subject: Re: [backstage] Geotagging BBC news stories
 
 
 I think the separation of the original data from the annotation is
 attractive. Yes, the annotation would be almost meaningless without the
 original data, but it would save the annotation owner from republishing
 the original content and any considerations of legality, etc. In
 addition, it would drive traffic to the source of the original content
 (always a good thing).
 
 There are (at least) two possible approaches: a query approach (give me
 your annotation for this guid) and a syndicated approach. The syndicated
 approach is less of a resource strain as the provider just has to
 deliver something very similar to rss, they don't have to have a query
 service running, just a document server (web server).
 
 In terms of developing the idea, we could certainly start with getting
 feedback in this forum and even have some implementations provide a
 service (Davy?). But I'd also like to hear any advice about how we can
 rapidly formalize this and get the idea out there along with a proposal
 for an api/document format. If only we had that wiki we were all so
 excited about up and running...
 
 As the format is simple (guid mapping to some bundle of information), I
 guess we are thinking of something like:
 
 ..
 annotationStream feedURL= bbc feed that is being annotated
 annotationURI=http://www.latedecember.com/newsMood.xml;.
 annotation guid=...
 !-- put your annotation here--
 /annotation
 ...more of above...
 /annotationStream
 
 Note that the stream points to both the rss feed and the annotation
 information (which ought to describe the syntax and semantics of the
 element at !--put your annotation here--)
 
 Davy, if you put a file up like this, I could easily integrate it into
 my data sphere thing and put up some screen shots -  we could claim that
 constitutes the first implementation!
 
 Matt
 
 http://datamining.typepad.com
 
 On 7/22/05, Davy Mitchell [EMAIL PROTECTED] wrote:
  On 7/21/05, Tony Hirst [EMAIL PROTECTED] wrote:
   backstage@lists.bbc.co.uk on 21 July 2005 at 21:52 + wrote:
it seems you have the same motivation as I had when I mailed this
   list about RSS Annotation Streams!
 
  The whole idea sounds great. I have plans to expose some of the Mood
  News data but never thought of anything quite so detailed. How about
  an XML with the URL as GUID plus the data (rating, colour etc) ? Would
 
  it need a basic schema structure?
 
  Could lead to some very interesting client applications!
 
  Thanks,
  Davy
 
  --
 
  Davy Mitchell
 
  http://www.latedecember.com
 
 
  -
  Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe,
  please visit
  http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.
 
 
 
 -
 Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe,
 please visit
 http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.
 
 
 -
 Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
 visit http

Re: [backstage] Geotagging BBC news stories

2005-07-22 Thread Matthew Hurst
This is great. I will get something together asap.

Thanks!

Matt

On 7/22/05, Davy Mitchell [EMAIL PROTECTED] wrote:
 On 7/22/05, Matthew Hurst [EMAIL PROTECTED] wrote:
  I'm still for something like the original file format I suggested
  earlier. Simplicity is very attractive in this type of endevour.
 
 :-)
 
 http://www.latedecember.com/sites/moodnews/moodnewsanno.xml
 
 Its one big xml of all the feeds as I don't store the feed URL in the
 db (yet!). Should be enough for a demo... Email me if you need
 anything changed etc.
 
 Interesting stuff!!
 
 Mood News development trundles on with a mix of back end and client
 stuff being added. It has been slowed by me hunting for the perfect
 Python IDE. A new version next week perhaps?
 
 Thanks,
 
 Davy Mitchell
 
 http://www.latedecember.com
 
 
 -
 Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
 visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.



-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.


Re: [backstage] Geotagging BBC news stories

2005-07-22 Thread Matthew Hurst
More thoughts: the annotation spec should include information about when the
rss feed was crawled and also when the annotation stream was created.

I have some data now (both your annotation and some rss files with
matching guids) saved to disc, I'll post the results sometime this
weekend.

MattH

On 7/22/05, Matthew Hurst [EMAIL PROTECTED] wrote:
 This is great. I will get something together asap.
 
 Thanks!
 
 Matt
 
 On 7/22/05, Davy Mitchell [EMAIL PROTECTED] wrote:
  On 7/22/05, Matthew Hurst [EMAIL PROTECTED] wrote:
   I'm still for something like the original file format I suggested
   earlier. Simplicity is very attractive in this type of endevour.
 
  :-)
 
  http://www.latedecember.com/sites/moodnews/moodnewsanno.xml
 
  Its one big xml of all the feeds as I don't store the feed URL in the
  db (yet!). Should be enough for a demo... Email me if you need
  anything changed etc.
 
  Interesting stuff!!
 
  Mood News development trundles on with a mix of back end and client
  stuff being added. It has been slowed by me hunting for the perfect
  Python IDE. A new version next week perhaps?
 
  Thanks,
 
  Davy Mitchell
 
  http://www.latedecember.com
 
 
  -
  Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
  visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.
 



-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.


Re: [backstage] Geotagging BBC news stories

2005-07-21 Thread Duncan Barclay




Hi,

I could output geotagged news stories in xml if you like, but it would
mean setting something up to actually work out where they are on a
regular basis (at the moment the script I made to do it is only
manual), although that is planned for another prototype I am in the
middle of making.. Some data from ages ago is plotted at
http://backstage.min-data.co.uk/geotagged/ .

I don't think Yahoo's mapping service will be widely used as it is
nowhere near as nice as Google's equivalent. It can't be long to the
Microsoft one is release (at least in the US), which looks like it
might be quite nice.

I can probably get stuff up sometime after Open Tech (which I plan to
go to by the way, assuming there aren't too many underground lines
closed).

Duncan


Tony Hirst wrote:

  Several sites are plotting news stories on Gmaps I beleive (anyone using
yahoomaps yet?) and as a result generating geo info for news stories. Lazy
as I am, is anyone syndicating this geo info indexed/searchable by BBC
news story ID?

thanks
tony
---
SEE THE WICKED ROBOT INVASION MAP
http://www.wickedrobots.co.uk/
---
Mail tags:
---
Tony Hirst
mailto:[EMAIL PROTECTED]
blog: http://micro-info.blogspot.com/

Dept. of ICT, Faculty of Technology
Open University, Walton Hall,
Milton Keynes, MK7 6AA, UK

Tel: +44 (0)19086 52789, m./SMS 07709 766223

http://robofesta.open.ac.uk/tony
http://www.robofesta-uk.org



-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.

  





Re: [backstage] Geotagging BBC news stories

2005-07-21 Thread Matthew Hurst
Tony, it seems you have the same motivation as I had when I mailed this list
about RSS Annotation Streams! Perhaps this is format that we want to develop
further?

Matt

On 7/21/05, Tony Hirst [EMAIL PROTECTED] wrote:
 
 Several sites are plotting news stories on Gmaps I beleive (anyone using
 yahoomaps yet?) and as a result generating geo info for news stories. Lazy
 as I am, is anyone syndicating this geo info indexed/searchable by BBC
 news story ID?
 
 thanks
 tony
 ---
 SEE THE WICKED ROBOT INVASION MAP
 http://www.wickedrobots.co.uk/
 ---
 Mail tags:
 ---
 Tony Hirst
 mailto:[EMAIL PROTECTED]
 blog: http://micro-info.blogspot.com/
 
 Dept. of ICT, Faculty of Technology
 Open University, Walton Hall,
 Milton Keynes, MK7 6AA, UK
 
 Tel: +44 (0)19086 52789, m./SMS 07709 766223
 
 http://robofesta.open.ac.uk/tony
 http://www.robofesta-uk.org
 
 
 
 -
 Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
 visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.



-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.


Re: [backstage] Geotagging BBC news stories

2005-07-21 Thread Tony Hirst
backstage@lists.bbc.co.uk on 21 July 2005 at 21:52 + wrote:
 it seems you have the same motivation as I had when I mailed this list
about RSS Annotation Streams! 

missed that - ah:
http://www.mail-archive.com/backstage@lists.bbc.co.uk/msg00330.html and
http://datamining.typepad.com/data_mining/2005/07/rss_annotation_.html

I agree with the sentiment, and then some - if someone goes to the trouble
of adding value to a story/feed, it would be good if those extras could be
syndicated separately but identifiably too without others having to
reinvent/reimplement the same technique/recreate the same annotation data.

Perhaps this is format that we want to develop
further?

That sounds like an interesting (and perhaps useful) exercise...where to
start? i tend to work back to generalised solutions from a couple of
similar but different implementations of a thing...would it be fair to say
that the annotation stream on its own would be relatively worthless
without the original story it was annotating? or could you imageine it
ever standing alone on its own terms?


Back to the geotagged BBC stories, even something as simple as returning
lat/long when passed a story ID would be v reusable... 

For example:
http://backstage.bbc.co.uk/prototypes/archives/2005/05/a_map_of_the_ne.html
 http://boneill.ninjagrapefruit.com/wp-content/bbc/newmaps/ divines geo
data

and  Duncan's http://backstage.min-data.co.uk/geotagged/  must have a
script for locating stories 
So if someone can make story id/geotag info available, then I'm sure the
lazy community (of which I count myself a part) would be v grateful :-)

tony


-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.