Dear Apache Jena Community,

I am Craig Turrell, the Head of Design and AI at Standard Chartered. My
team and I have been using Apache Jena extensively for semantic web
applications and appreciate its robust foundation for RDF data management.

After working with Jena on multiple large-scale knowledge graph projects,
we've identified several areas where we believe we could contribute
meaningful architectural improvements to help Jena better serve modern,
AI-integrated semantic web applications. We've conducted extensive research
on these limitations and potential solutions, and we're excited about the
possibility of working with the community to implement enhancements.

The key areas we're interested in addressing include:

   1.

   *Horizontal scaling beyond single-machine constraints* - Extending
   Jena's capabilities for truly distributed datasets by building upon
   existing work like RDF-Delta and Fuseki-Kafka
   2.

   *Memory management optimizations for large datasets* - Improving
   handling of billion-triple graphs through enhanced encoding, caching, and
   integration with formats like HDT
   3.

   *Enhanced reasoning capabilities* - Extending support for more
   expressive ontology reasoning and building on the recent Jena-ontapi module
   4.

   *Native vector support for AI integration* - Adding capabilities for
   embedding storage and similarity search to bridge RDF and vector-based AI
   models

Based on our research of successful contribution patterns in Apache
projects, we're proposing a phased approach, starting with incremental
improvements to horizontal scaling while maintaining backward
compatibility. We understand the importance of community involvement,
comprehensive testing, and thorough documentation.

We would greatly appreciate your feedback on:

   - Whether these areas align with the current project roadmap and
   community interests
   - Suggestions for which specific limitations would be most valuable to
   address first
   - Guidance on the best way to proceed with initial contributions
   - Any existing work or discussions related to these areas that we should
   be aware of

I've reviewed the contribution guidelines and am ready to create GitHub
issues and submit PRs for specific improvements once we have better
alignment with the community's priorities.

Thank you for your consideration. We look forward to your thoughts and to
potentially becoming active contributors to this important project.

Best regards,

Craig Turrell

craig.turr...@sc.com

Reply via email to