## Description:
- Apache Samza is a distributed stream processing engine that are highly
configurable to process events from various data sources, including
real-time messaging system (e.g. Kafka) and distributed file systems
(e.g.
HDFS).
## Issues:
- No issues requires board attention
## Activity:
- Samza 1.0 is released:
- News coverage:
https://www.zdnet.com/article/real-time-data-proces
sing-just-got-more-options-linkedin-releases-apache-samza-1-0-streaming/
- Engineering blogs:
https://engineering .
linkedin.com/blog/2018/11/samza-1-0--stream-processing-at-massive-scale
- Major online website refresh: http://samza.apache.org/
- Critical improvement projects completed:
- Changelog restore parallelization
- Evaluated HDFS based backup/restore of state stores
- Multiple SEP projects initiated or in-progress:
- SEP-18: allows manipulating starting offsets and time-based rewind
- SEP-19: Fast failover for stateful jobs on container failure (i.e.
standby container)
- SEP to come soon: async high-level API
- Beam Samza runner upgrade to use Samza 1.0
- Go and Python support via Beam Samza runner
## Health report:
- Project is in healthy status with 1.0 released in Nov 2018
## PMC changes:
- Currently 15 PMC members.
- Prateek Maheshwari was added to the PMC on Thu Nov 01 2018
## Committer base changes:
- Currently 22 committers.
- New commmitters:
- Aditya Toomula was added as a committer on Mon Nov 05 2018
- Hai Lu was added as a committer on Mon Nov 05 2018
## Releases:
- Last release was 1.0 on Nov 28, 2018
## /dist/ errors: 9
- Project is in healthy status with a major release pending in Oct
## Mailing list activity:
- [email protected]:
- 271 subscribers (down -13 in the last 3 months):
- 445 emails sent to list (288 in previous quarter)
## JIRA activity:
- 111 JIRA tickets created in the last 3 months
- 57 JIRA tickets closed/resolved in the last 3 months