[jira] [Commented] (SOLR-13105) A visual guide to Solr Math Expressions and Streaming Expressions

Cassandra Targett (Jira) Fri, 08 Jan 2021 13:26:07 -0800


    [ 
https://issues.apache.org/jira/browse/SOLR-13105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17261590#comment-17261590
 ]


Cassandra Targett commented on SOLR-13105:
------------------------------------------

I put all the comments together to respond to them.

Re: 1, 5, 7, 11, 12, 14, 15 - I don't know where the datasets came from so I 
feel like that shouldn't hold up merging this. They can be added later.

1. enron_emails emails collection, wish we could follow along iwth that dataset.
2. math-start.adoc line 105 add link to Zeppelin? It's not the most well known 
project!
-- I don't know why this needs a 2nd opinion. Why would we ever not link 
somewhere if it's helpful for the reader? 

3. check math-start.adoc line 121 the parameters!
-- ?? line 121 is a section heading. If you know what's wrong, go ahead and fix 
it?

4. Any thoughts on putting all the various .adoc files in a sub dir? So many of 
them! So we know `loading.adoc` isn't a generic solr loading help page!
-- Answered in previous comment

5. loading.adoc - wish we had the iris.csv data file as well to follow along, 
plus to easily test the queries!
6. should all field names be wrapped with stars? I.e `filesize_d` --> 
`filesize_d`
-- I'm fixing some/most of these as I go along but it's really tedious so no 
reason I have to be the only person who does so. Whenever something's wrong, 
fix it.

7. search-sample.adoc, line 177, and would like the nyc311 dataset as well!
8. variables.adoc, line 206 appears awkward. Strike the "an be cached in-memory 
for future use."
-- So, you can fix it?

9. variables.adoc, sometimes a variable is wrapped in both code comments and 
bold, and sometime not. In other places, we appear to just wrap them in bold 
tags. Also in term-vectors.adoc
-- It's all over the place, not just these files. I'm fixing some as I catch 
them, but it's tedious as hell so how about you also go ahead and fix some.

10. matrix-math.adoc line 127, starting here, we introduce some visualizations 
in Apache Zeppelin that are not out of the box zeppelin ones. would be nice to 
know how to recreate this visualization! How do I add the heat map  Maybe a 
seperate call out to "HOw I set up Zeppelin" would be useful.
-- A separate page for Zeppelin is a good idea. Again, I don't think it needs 
to hold this up specifically, so go ahead if you'd like or I might eventually 
get to it.

11. term-vectors.adoc We introduce some movie reviews. again, would love to 
have the sample data!
12. statistics.adoc We introduce the logs collection (I think it's source from 
Solr), again, would love to have a call out on how to set it up myself!

13. in the statistics.adoc, we refer to stock ticker symbols.. Should they be 
all caps? Isn't amzn always AMZN???
-- Yeah, but sometimes it depends a bit on context - if putting it in caps 
would break an example for some reason lower case would be correct. Needs some 
more review.

14. statistics.adoc introduces a stock prices collection. Guess what I'd like 
15. machine-learning.adoc introduces a wine collection.. Yep, I am a broken 
record 

> A visual guide to Solr Math Expressions and Streaming Expressions
> -----------------------------------------------------------------
>
>                 Key: SOLR-13105
>                 URL: https://issues.apache.org/jira/browse/SOLR-13105
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Joel Bernstein
>            Assignee: Joel Bernstein
>            Priority: Major
>         Attachments: Screen Shot 2019-01-14 at 10.56.32 AM.png, Screen Shot 
> 2019-02-21 at 2.14.43 PM.png, Screen Shot 2019-03-03 at 2.28.35 PM.png, 
> Screen Shot 2019-03-04 at 7.47.57 PM.png, Screen Shot 2019-03-13 at 10.47.47 
> AM.png, Screen Shot 2019-03-30 at 6.17.04 PM.png
>
>
> Visualization is now a fundamental element of Solr Streaming Expressions and 
> Math Expressions. This ticket will create a visual guide to Solr Math 
> Expressions and Solr Streaming Expressions that includes *Apache Zeppelin* 
> visualization examples.
> It will also cover using the JDBC expression to *analyze* and *visualize* 
> results from any JDBC compliant data source.
> Intro from the guide:
> {code:java}
> Streaming Expressions exposes the capabilities of Solr Cloud as composable 
> functions. These functions provide a system for searching, transforming, 
> analyzing and visualizing data stored in Solr Cloud collections.
> At a high level there are four main capabilities that will be explored in the 
> documentation:
> * Searching, sampling and aggregating results from Solr.
> * Transforming result sets after they are retrieved from Solr.
> * Analyzing and modeling result sets using probability and statistics and 
> machine learning libraries.
> * Visualizing result sets, aggregations and statistical models of the data.
> {code}
>  
> A few sample visualizations are attached to the ticket.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SOLR-13105) A visual guide to Solr Math Expressions and Streaming Expressions

Reply via email to