[
https://issues.apache.org/jira/browse/SOLR-13105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17261590#comment-17261590
]
Cassandra Targett commented on SOLR-13105:
------------------------------------------
I put all the comments together to respond to them.
Re: 1, 5, 7, 11, 12, 14, 15 - I don't know where the datasets came from so I
feel like that shouldn't hold up merging this. They can be added later.
1. enron_emails emails collection, wish we could follow along iwth that dataset.
2. math-start.adoc line 105 add link to Zeppelin? It's not the most well known
project!
-- I don't know why this needs a 2nd opinion. Why would we ever not link
somewhere if it's helpful for the reader?
3. check math-start.adoc line 121 the parameters!
-- ?? line 121 is a section heading. If you know what's wrong, go ahead and fix
it?
4. Any thoughts on putting all the various .adoc files in a sub dir? So many of
them! So we know `loading.adoc` isn't a generic solr loading help page!
-- Answered in previous comment
5. loading.adoc - wish we had the iris.csv data file as well to follow along,
plus to easily test the queries!
6. should all field names be wrapped with stars? I.e `filesize_d` -->
`filesize_d`
-- I'm fixing some/most of these as I go along but it's really tedious so no
reason I have to be the only person who does so. Whenever something's wrong,
fix it.
7. search-sample.adoc, line 177, and would like the nyc311 dataset as well!
8. variables.adoc, line 206 appears awkward. Strike the "an be cached in-memory
for future use."
-- So, you can fix it?
9. variables.adoc, sometimes a variable is wrapped in both code comments and
bold, and sometime not. In other places, we appear to just wrap them in bold
tags. Also in term-vectors.adoc
-- It's all over the place, not just these files. I'm fixing some as I catch
them, but it's tedious as hell so how about you also go ahead and fix some.
10. matrix-math.adoc line 127, starting here, we introduce some visualizations
in Apache Zeppelin that are not out of the box zeppelin ones. would be nice to
know how to recreate this visualization! How do I add the heat map Maybe a
seperate call out to "HOw I set up Zeppelin" would be useful.
-- A separate page for Zeppelin is a good idea. Again, I don't think it needs
to hold this up specifically, so go ahead if you'd like or I might eventually
get to it.
11. term-vectors.adoc We introduce some movie reviews. again, would love to
have the sample data!
12. statistics.adoc We introduce the logs collection (I think it's source from
Solr), again, would love to have a call out on how to set it up myself!
13. in the statistics.adoc, we refer to stock ticker symbols.. Should they be
all caps? Isn't amzn always AMZN???
-- Yeah, but sometimes it depends a bit on context - if putting it in caps
would break an example for some reason lower case would be correct. Needs some
more review.
14. statistics.adoc introduces a stock prices collection. Guess what I'd like
15. machine-learning.adoc introduces a wine collection.. Yep, I am a broken
record
> A visual guide to Solr Math Expressions and Streaming Expressions
> -----------------------------------------------------------------
>
> Key: SOLR-13105
> URL: https://issues.apache.org/jira/browse/SOLR-13105
> Project: Solr
> Issue Type: New Feature
> Reporter: Joel Bernstein
> Assignee: Joel Bernstein
> Priority: Major
> Attachments: Screen Shot 2019-01-14 at 10.56.32 AM.png, Screen Shot
> 2019-02-21 at 2.14.43 PM.png, Screen Shot 2019-03-03 at 2.28.35 PM.png,
> Screen Shot 2019-03-04 at 7.47.57 PM.png, Screen Shot 2019-03-13 at 10.47.47
> AM.png, Screen Shot 2019-03-30 at 6.17.04 PM.png
>
>
> Visualization is now a fundamental element of Solr Streaming Expressions and
> Math Expressions. This ticket will create a visual guide to Solr Math
> Expressions and Solr Streaming Expressions that includes *Apache Zeppelin*
> visualization examples.
> It will also cover using the JDBC expression to *analyze* and *visualize*
> results from any JDBC compliant data source.
> Intro from the guide:
> {code:java}
> Streaming Expressions exposes the capabilities of Solr Cloud as composable
> functions. These functions provide a system for searching, transforming,
> analyzing and visualizing data stored in Solr Cloud collections.
> At a high level there are four main capabilities that will be explored in the
> documentation:
> * Searching, sampling and aggregating results from Solr.
> * Transforming result sets after they are retrieved from Solr.
> * Analyzing and modeling result sets using probability and statistics and
> machine learning libraries.
> * Visualizing result sets, aggregations and statistical models of the data.
> {code}
>
> A few sample visualizations are attached to the ticket.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]