Attached are modifications to the Quarterly Report (thank you Dave!).
Any more thoughts? Anyone?
Answer to Dave's question: Have any candidates for committer or PPMC
> membership been seen anywhere?
No, not yet.
Lee.
On Thu, Jan 16, 2020 at 8:21 AM Dave Fisher <[email protected]> wrote:
> Hi -
>
> Items #1 and #2 on what to improve are really the same. A proper #2 should
> be to have more substantive technical discussions on the dev@datasketches
> mailing list rather than a Google group. The mailing list should be the
> most prominent way to connect with the project and not the least (according
> to the Group page)
>
> Please include information in the report about activity levels on slack
> and google groups.
>
> Have any candidates for committer or PPMC membership been seen anywhere?
> If so then please start a discussion on private@. If not then a
> discussion on dev@ should proceed about what the project is looking for
> and how someone could earn merit.
>
> Regards,
> Dave
>
> > On Jan 15, 2020, at 6:45 PM, leerho <[email protected]> wrote:
> >
> > Folks,
> > This is my first cut at a draft report that needs to be completed by Feb
> 5th End-of-day.
> >
> > Please, please -- feel free to edit, revise or add comments.
> >
> > Whimsy and other tool expect the report to formatted in a certain way so
> please make sure that you try to:
> > • Keep all lines under 76 characters long.
> > • All content under the ### headings should be indented by two
> spaces. Do not use tabs.
> > • Please don't change the text in the headings or add new ones.
> > • Include a space after a bullet point or full stop on a
> numbered list.
> > • Use [X] (X and no spaces) to sign off reports.
> >
> > It might be easier if you give me the section and text you want to edit
> or add to and I will merge it into a master markdown.
> >
> > Thanks!
> >
> > Lee.
> >
> > <QR_2020-02-05.md>
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: [email protected]
> > For additional commands, e-mail: [email protected]
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]
>
>
## DataSketches 19 Feb 2020
DataSketches is an open source, high-performance library of stochastic
streaming algorithms commonly called "sketches" in the data sciences.
Sketches are small, stateful programs that process massive data as a stream
and can provide approximate answers, with mathematical guarantees, to
computationally difficult queries orders-of-magnitude faster than
traditional, exact methods.
DataSketches has been incubating since 2019-03-30.
### Three most important unfinished issues to address before graduating:
1. Finish migrating old website to Apache destination and update with
the new logo.
2. We need to have more substantive discussions on our dev@ list especially about our growing
TODO list and how we plan to address them.
3. Find / Attract new committers outside Yahoo!
### Are there any issues that the IPMC or ASF Board need to be aware of?
No
### How has the community developed since the last report?
We are presenting at more conferences which has attracted some interest.
We are definitely getting more traffic on our forum, GitHub issues
and email lists. We recently added two channels on the-asf@slack: #datasketches
and #datasketches-dev. The traffic has been fairly low on Slack as well as
the forum. We could do more to publicize the slack channels. I could be
optimistic and believe the low traffic is due to the holidays -- or that the
code just works :)
Nonetheless, the download traffic measured by repository.a.o
has grown exponentially since our first Apache release on Sep 23. We are over 1000
unique IPs/ month and had a recent high of 22K downloads/ month. Bear in mind
that this is all traffic that has migrated from the older, pre-Apache artifacts
at com.yahoo.datasketches and is already higher than our peak downloads prior to
Apache. These numbers also do not reflect any downloads of our Zip artifacts
from a.o./dist (which includes our C++ artifacts) or other external download
repositories (for example, specific to PostgreSQL).
### How has the project developed since the last report?
Our releases are becoming easier, more polished and routine.
Nonetheless, our website needs a lot of work (as mentioned above) and this will
become our focus for the next month or so.
### How would you assess the podling's maturity?
Please feel free to add your own commentary.
- [ ] Initial setup
- [ ] Working towards first release
- [X] Community building
- [ ] Nearing graduation
- [ ] Other:
### Date of last release:
These are the major components and their last release dates:
* DataSketches-Java 2019-10-10
* DataSketches-Memory 2019-11-21
* DataSketches-CPP 2019-09-17
* DataSketches-Hive 2019-10-11
* DataSketches-Pig 2019-10-18
* DataSketches-Postgresql 2019-10-29
### When were the last committers or PPMC members elected?
No new committers since April, 2019.
### Have your mentors been helpful and responsive?
Yes.
Are things falling through the cracks? If so, please list any
open issues that need to be addressed.
No open issues other than our website troubles :(
### Is the PPMC managing the podling's brand / trademarks?
To the best of our knowledge, yes.
* Are 3rd parties respecting and correctly using the podlings name and brand?
As far as we know, yes.
* If not what actions has the PPMC taken to correct this?
We have not had to face this issue yet.
* Has the VP, Brand approved the project name?
Yes, and it is clearly stated as such on
http://incubator.apache.org/projects/datasketches.html
### Signed-off-by:
- [ ] (datasketches) Liang Chen
Comments:
- [ ] (datasketches) Kenneth Knowles
Comments:
- [ ] (datasketches) Furkan Kamaci
Comments:
- [ ] (datasketches) Dave Fisher
Comments:
- [ ] (datasketches) Evans Ye
Comments:
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]