Re: [CODE4LIB] Keyword Extraction from Text

2019-09-17 Thread Chris Moschini
We use Google Universal Sentence Encoder which operates on TensorFlow. https://tfhub.dev/google/universal-sentence-encoder/1 It's impressive because it can handle multi-word "keywords", which ends up being forgotten a lot in planning, but critical in actual use. It's pretrained on several languag

Re: [CODE4LIB] Keyword Extraction from Text

2019-09-17 Thread Mark Watkins
It does depend a bit on what kinds of "key terms" or "important words" you have in mind, but I have had good luck with Google's NLP APIs. They free for small numbers of queries (if memory serves in the thousands per day, but don't quote me on it). It does a good job of identifying people, places

Re: [CODE4LIB] Keyword Extraction from Text

2019-09-17 Thread Jessica Milby
Hi Athina-- A while back I used a program called antconc (https://www.laurenceanthony.net/software/antconc/) to do textual analysis for public-facing museum content. It was incredibly helpful even though it doesn't look like much...especially if you don't know python or want to learn it. It's

Re: [CODE4LIB] Keyword Extraction from Text

2019-09-16 Thread Eric Lease Morgan
On Sep 16, 2019, at 12:20 PM, Athina Livanos-Propst wrote: > I'm starting to think around a project that would involve key terms from > other types of text (transcripts, captions, documents). I'm basically trying > to build a tool that I can use to extra key terms from larger strings of > tex

Re: [CODE4LIB] Keyword Extraction from Text

2019-09-16 Thread Kyle Banerjee
Hi Athina, The extractors are very different in terms of what they're optimized to work with and what they're designed to extract -- you need one designed for your purposes, and you may need more than one. A few years back, I experimented with a number of extractors before settling on Alchemy a

Re: [CODE4LIB] Keyword Extraction from Text

2019-09-16 Thread Allen Kwan
onday, September 16, 2019 12:27 PM > To: CODE4LIB@LISTS.CLIR.ORG > Subject: Re: [CODE4LIB] Keyword Extraction from Text > > Hi Athina, > Voyant is a good tool for getting started > (https://urldefense.proofpoint.com/v2/url?u=https-3A__voyant-2Dtools.org_&d=DwIFAg&c=QX2Of

Re: [CODE4LIB] Keyword Extraction from Text

2019-09-16 Thread Athina Livanos-Propst
To: CODE4LIB@LISTS.CLIR.ORG Subject: Re: [CODE4LIB] Keyword Extraction from Text Hi Athina, Voyant is a good tool for getting started (https://urldefense.proofpoint.com/v2/url?u=https-3A__voyant-2Dtools.org_&d=DwIFAg&c=QX2OfGk7aRC3kh1nmtbeQQ&r=RXIBTdlDWSmqO0o3MLQ-BLQUZFYg5gjAQ

Re: [CODE4LIB] Keyword Extraction from Text

2019-09-16 Thread Pikas, Christina K.
Baltimore: 443.778.4812 D.C.: 240.228.4812 christina.pi...@jhuapl.edu -Original Message- From: Code for Libraries On Behalf Of Athina Livanos-Propst Sent: Monday, September 16, 2019 12:21 PM To: CODE4LIB@LISTS.CLIR.ORG Subject: [CODE4LIB] Keyword Extraction from Text Hi everyone, I'm sta

[CODE4LIB] Keyword Extraction from Text

2019-09-16 Thread Athina Livanos-Propst
Hi everyone, I'm starting to think around a project that would involve key terms from other types of text (transcripts, captions, documents). I'm basically trying to build a tool that I can use to extra key terms from larger strings of text, i.e. pull out the important words from a larger sente