Denis, I think it is fine to have PLSA in MLlib. But I'm not familiar with the modification you mentioned since the paper is new. We may need to spend more time to learn the trade-offs. Feel free to create a JIRA for PLSA and we can move our discussion there. It would be great if you can share your current implementation. So it is easy for developers to join the discussion.
Jayati, it is certainly NOT mandatory. But if you want to contribute something new, please create a JIRA first. Best, Xiangrui