[ 
https://issues.apache.org/jira/browse/CASSANDRA-873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12849382#action_12849382
 ] 

Ben Standefer edited comment on CASSANDRA-873 at 3/24/10 7:53 PM:
------------------------------------------------------------------

A good idea brought up by Edward Capriolo (not a student, so students feel free 
to run with this one) is a Splunk knock-off.  Splunk is software that indexes 
logs (syslog, Apache logs, app logs, whatever) in lots of different ways and 
makes your logs highly searchable and filter-able via a front-end web 
interface.  http://www.splunk.com/product.

While the Splunk product is powerful and awesome, the licensing is not (they 
license by usage instead of per-seat).

I think a Splunk knock-off would be a good demo app for people just getting 
into Cassandra because parsing logs is an easy concept to understand and it 
could start off very simple.  There is a lot of opportunity to utilize all 
features of the Cassandra API (range queries, search indexes, property-specific 
indexes).  This could be made very high scalable by utilizing Scribed 
(http://github.com/facebook/scribe), a scalable logging solution that many 
Cassandra users are already using to store their logs.  It's like rsyslogd on 
crack.

Think Facebook's classic "Inbox Search" for 400M users, but applied to log data.

      was (Author: aguynamedben):
    A good idea brought up by Edward Capriolo (not a student, so students feel 
free to run with this one) is a Splunk knock-off.  Splunk is software that 
indexes logs (syslog, Apache logs, app logs, whatever) in lots of different 
ways and makes your logs highly searchable and filter-able via a front-end web 
interface.  http://www.splunk.com/product.

While the Splunk product is powerful and awesome, the licensing is not (they 
license by usage instead of per-seat).

I think a Splunk knock-off would be a good demo app for people just getting 
into Cassandra because parsing logs is an easy concept to understand and it 
could start off very simple.  There is a lot of opportunity to utilize all 
features of the Cassandra API (range queries, search indexes, property-specific 
indexes).  This could be made very high scalable by utilizing Scribed 
(http://highscalability.com/blog/2008/11/24/product-scribe-facebooks-scalable-logging-system.html),
 a scalable logging solution that many Cassandra users are already using to 
store their logs.  It's like rsyslogd on crack.

Think Facebook's classic "Inbox Search" for 400M users, but applied to log data.
  
> Create a Cassandra demo application
> -----------------------------------
>
>                 Key: CASSANDRA-873
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-873
>             Project: Cassandra
>          Issue Type: Task
>            Reporter: Jonathan Ellis
>            Priority: Minor
>
> http://twissandra.com/ is a demo Cassandra application built on django + 
> pycassa.  It's a great Cassandra showcase and very useful for people learning 
> Cassandra.  We could use more of those.
> Jake Luciani suggested one that presents full-text search of Wikipedia using 
> Lucandra (see 
> http://blog.sematext.com/2010/02/09/lucandra-a-cassandra-based-lucene-backend/
>  and http://github.com/tjake/Lucandra).  Feel free to propose other 
> application ideas here.
> Rackspace is willing to provide a VM to deploy on for a live demo, but 
> remember, to be really useful this needs full DIY instructions, the final 
> product is not the demo but the code + instructions.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to