chiruu12 commented on issue #183: URL: https://github.com/apache/incubator-hugegraph-ai/issues/183#issuecomment-2689160214
Also can you tell me will we be using an in-house trained LLM or we will be inferencing it from somewhere else? If we're using inference endpoints, we could leverage Grok's API for making the agent faster as it used LPU's or maybe we can use HuggingFace inference endpoints after fine-tuning a smaller model specifically for our RAG tasks could provide a good balance of performance, accuracy and cost-efficiency. I have a few models that we can use in mind too please let me know if you will be open to discuss that. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
