Re: Use-case detail

Eric Miller Wed, 15 Mar 2006 10:44:01 -0800


On Mar 15, 2006, at 12:49 PM, Brian Osborne wrote:

Eric et al.,
Working on writing up some use cases. Chembank is a nice compounddatabase
for demonstration purposes since it associates some fraction of its
compounds with MeSH Diseases terms (
http://chembank.broad.harvard.edu/chemistry/search/input/ontology.htm), it
refers to this ontology as Therapeutic Indication. They also use GO
Biological Process.
A year or so ago you could could access its pages by GET, now itlooks likeit's doing a POST - is this a problem for our programmers? Nodescription of
any API, as far as I can see.

POST only access and no API certainly makes it more difficult toreuse any of this data :(

Regarding when to use GET vs POST, I've found the following resourceuseful...

[[

An important principle of Web architecture is that all importantresources be identifiable by URI. The finding discusses therelationship between the URI addressability of a resource and thechoice between HTTP GET and POST methods with HTTP URIs. HTTP GETpromotes URI addressability so, designers should adopt it for safeoperations such as simple queries. POST is appropriate for othertypes of applications where a user request has the potential tochange the state of the resource (or of related resources). Thefinding explains how to choose between HTTP GET and POST for anapplication taking into account architectural, security, andpractical considerations.

]]
-- http://www.w3.org/2001/tag/doc/whenToUseGet.html

A bit of browsing around looks like there are at least some GETableresources so there might be some data one could gleen


e.g.

http://chembank.broad.harvard.edu/chemistry/search/input/moleculeName.htm

search on '*sulfide*' and then hit 'search' to add Substructure. thisyeilds for example the following search result


disulfiram / ChemBankID: 2038
- http://chembank.broad.harvard.edu/chemistry/viewMolecule.htm?cbid=2038

which points to "find similar molecules"

- http://chembank.broad.harvard.edu/chemistry/findSimilarMolecules.htm?cbid=2038

The system seems session based, but at least parts of the data seemscrapeable.

As you seem to be exploring more the Piggy-bank scraper idea (per thesimile general list), the Open World cat scraper [1] is an example ofa session-based, muti-page scraper than could be adapted to at leastparts of the data on this site.


[1] http://potlach.org/2005/10/scrapers/

--
eric miller                              http://www.w3.org/people/em/
semantic web activity lead               http://www.w3.org/2001/sw/
w3c world wide web consortium            http://www.w3.org/

Re: Use-case detail

Reply via email to