Good job Ash! Really useful! On Wed, Apr 14, 2021 at 10:22 PM Ash Berlin-Taylor <a...@apache.org> wrote:
> Hi Tao, > > It's written with being able to support multiple workspaces in mind, but > the code may not work 100%, so I'll need to take a look :) > > Roughly it's get a data export as an admin, and then install the slack app > in your workspace, and then I make some config changes on the install. > > Ping me if I don't get back to you in a week :) > > -ash > > On Wed, 14 Apr, 2021 at 10:48, Tao Feng <fengta...@gmail.com> wrote: > > Hey Ash, > > That's great! I am interested in using for my other open source project ( > https://github.com/amundsen-io/amundsen) which is currently hitting the > 10k message limit as well. I wonder what it takes to setup/enable for other > slack workspace? > > Thanks, > -Tao > > On Wed, Apr 14, 2021 at 5:40 AM Ash Berlin-Taylor <a...@apache.org> wrote: > >> Hello everyone, >> >> Thanks to prompting from Sumit, I have "resurrected" a project I started >> back in 2019, and have got searchable slack archives available: >> >> https://apache-airflow.slack-archives.org/ >> >> (This is a fancy looking URL, there is nothing else on the domain yet. >> Any other projects want this too?) >> >> A little known fact of Slack is that the export an admin can do contains >> *all* messages, not just the ones the client will show, so this has all >> 130k+ messages in the DB. For example >> https://apache-airflow.slack-archives.org/announcements/page-1 for >> example shows the very first message from Fokko >> >> This service also has a bot user, called Archie the Archive Bot, that if >> invited to channels will listen for messages (and deletions/edits etc). To >> get this bot in the channel we need to run /invite @archie -- I'm not >> sure if only workspace admins can do this or if anyone can. >> >> Features that are missing/broken/confusing right now: >> - Thread replies aren't handled visually correctly -- rather than being >> nested under the original message they are just shown like a normal message. >> - No ability to permalink to a specific message >> - Markdown formatting in messages might be incomplete >> - Shared files/images may or may not be accessible. I haven't really >> tested it. >> - It needs a privacy policy/data protection statement >> - Since it's using VueJS (the project I forked this froms choice, not >> mine) it's probably not indexable by search engines. >> - The front end is only showing 10k messages per channel ( >> https://apache-airflow.slack-archives.org/random/page-1) -- given it's >> paged already there's no need for this limitation to exist. >> - Links to users don't go anywhere sensible. >> >> I'm sure there are many more small gotchas, but I didn't want perfect to >> be the enemy of the good. >> >> The code for this service lives at https://github.com/ashb/slackarchive >> - PRs welcome ;) Most of the readme is still wrong there. >> >> If anyone isn't happy with this I can delete it, or set certain channels >> to not be archived etc. >> >> -ash >> > -- +48 660 796 129