Re: [CODE4LIB] tar without compression? General comments welcome

2023-09-21 Thread Stuart A. Yeates
Depending on your platform and context, you may want to explicitly use the --selinux --acls --xattrs options. cheers stuart -- ...let us be heard from red core to black sky On Fri, 22 Sept 2023 at 07:32, Lolis, John wrote: > Long live tar! *T*ape *AR*chiving may be dead, but the command it ha

Re: [CODE4LIB] [External] Re: [CODE4LIB] What manner of creature is LaTeX?

2023-07-20 Thread Stuart A. Yeates
n't see any way you could separate those > format blocks from the text to be blocked out. Am I misunderstanding you? > > Steve McDonald > steve.mcdon...@tufts.edu > > > > -Original Mes

Re: [CODE4LIB] What manner of creature is LaTeX?

2023-07-20 Thread Stuart A. Yeates
LaTeX is a little like PostScript and Excel with autorun scripts: formats conceived and developed prior to the software development insight that separation of content and code need to be separate. Nowadays it is accepted that content should be split into text and style, but way back when, there wa

Re: [CODE4LIB] Searching Gmail by Star next to email

2022-05-08 Thread Stuart A. Yeates
label:starred appears to be your friend. See also label:unread, etc. cheers stuart -- ...let us be heard from red core to black sky On Mon, 9 May 2022 at 11:02, charles meyer wrote: > > I've Googled this issue but Gmail Help support page doesn't solve the > problem. > > You receive an approve

Re: [CODE4LIB] OJS cloud hosting

2022-03-10 Thread Stuart A. Yeates
https://pkpservices.sfu.ca/ are the people who developed OJS. I've not used them for hosting but had very positive interactions with them for debugging / fixing / patching stuff. They're also a unit of a Canadian university rather than a for-profit company. cheers stuart -- ...let us be heard from

Re: [CODE4LIB] It’s Official! Fedora 6.0 Production Release is Available

2021-07-08 Thread Stuart A. Yeates
Comms tip: When writing an announcement you hope is going to be widely circulated, make it clear less than three paragraphs in which of the two confusingly-similarly-named software projects you're writing about. cheers stuart -- ...let us be heard from red core to black sky On Fri, 9 Jul 2021 at

Re: [CODE4LIB] Digitization Outsourcing Survey

2021-06-23 Thread Stuart A. Yeates
How are you going to present answers to questions such as "In your opinion, what is the most interesting item or collection that your library has digitized?" anonymously, given that these are highly likely to be unique? "15. Top 3: Considering vendors and other partners that you have worked with,

Re: [CODE4LIB] facial hair names

2021-03-30 Thread Stuart A. Yeates
any pages are in the book’s > preface. In an attempt to make progress on a “read a book from every country > in the world” challenge I coded > https://deborahfitchett.com/toys/aroundtheworld/ using Wikidata but… needs > more data. > > Deborah > > From: Code for Libraries

Re: [CODE4LIB] facial hair names

2021-03-30 Thread Stuart A. Yeates
ah > Sent: Monday, March 29, 2021 7:46 PM > To: CODE4LIB@LISTS.CLIR.ORG<mailto:CODE4LIB@LISTS.CLIR.ORG> > Subject: Re: [CODE4LIB] facial hair names > > dc.coverage.facial > > From: Code for Libraries > mailto:CODE4LIB@LISTS.CLIR.ORG>> On Behalf Of Stuart > A. Yea

[CODE4LIB] facial hair names

2021-03-28 Thread Stuart A. Yeates
The CDC has released a list of facial hair names https://www.cdc.gov/niosh/npptl/pdfs/FacialHairWmask11282017-508.pdf If we want to use these for facetting, which metadata fields should we be using? cheers stuart -- ...let us be heard from red core to black sky

Re: [CODE4LIB] Moving away from handle service

2021-03-01 Thread Stuart A. Yeates
dle destination. Then > > throw > > a CloudFront distribution in front of it, change the DNS, and call it a > > day. > > You’ll spend a couple bucks a month for storage and bandwidth, but the > > ultimate control stays with the institution. > > > >

[CODE4LIB] Moving away from handle service

2021-03-01 Thread Stuart A. Yeates
My institution has used handles for more than a decade and would like to stop (non-standard ports, special server, etc), particularly as we're now committed to DOIs. However, we don't want to break URLs. Does anyone know of a third party service that we can hand a list of handle-to-URL-mappings a

Re: [CODE4LIB] Web app to search XML files

2020-12-17 Thread Stuart A. Yeates
There's XML and XML. I suggest that you enquire about the exact format that you're going to be receiving and ask around for systems that support it out of the box. cheers stuart -- ...let us be heard from red core to black sky On Fri, 18 Dec 2020 at 07:37, Pennington, Buddy D. wrote: > > Hi a

Re: [CODE4LIB] to stop word, or not to stop word. that is the question

2020-07-10 Thread Stuart A. Yeates
Sounds like a classical use for the tf–idf measure. For those with no background in information retrieval, see https://en.wikipedia.org/wiki/Tf%E2%80%93idf cheers stuart -- ...let us be heard from red core to black sky On Sat, 11 Jul 2020 at 06:58, Eric Lease Morgan wrote: > > To stop word, o

Re: [CODE4LIB] Robert Sandusky

2020-04-13 Thread Stuart A. Yeates
After allowing time for obits to appear, I would encourage you to consider writing a wikipeda article about Sandusky. Most late career academics meet the criteria. If you need help with the mechanics of writing I'm more than willing to help, but start with compling secondary sources. cheers stuart

Re: [CODE4LIB] open journal systems and oai

2020-03-12 Thread Stuart A. Yeates
red core to black sky On Fri, 13 Mar 2020 at 09:43, Eric Lease Morgan wrote: > > On Mar 12, 2020, at 3:44 PM, Stuart A. Yeates wrote: > > >> Where can I find a list of Open Journal System (OJS) journals and their > >> associated OAI-PMH data repository root UR

Re: [CODE4LIB] open journal systems and oai

2020-03-12 Thread Stuart A. Yeates
I believe that many years ago there was a comprehensive list published by the PKP (who produce OJS) based on the homing signals OJS uses to check for updates. That's gone now (privacy reasons?), DOAJ is a great place to start, but many journals don't qualify for DOAJ. cheers stuart -- ...let us

Re: [CODE4LIB] WARC --> static HTML?

2020-03-04 Thread Stuart A. Yeates
WARC is not an access format. WARC is entirely optimised for crawling and the gold standard for archiving because it's close to the 'on the wire' web experience. BUT There is no file index: you access every file using a linear search from the start of the archive. There is no guarantee that rela

Re: [CODE4LIB] Mobile access to IP-authenticated resources

2020-02-20 Thread Stuart A. Yeates
Depending on what level of control you have, you could potentially do some DNS black magic: have a VPN that sent all client requests for *.ebscohost.com to server.yourdomain.edu which then used httpd redirects (or similar) to rewrite those to include the proxy prefix. You'd need to include an appro

[CODE4LIB] https://portal.issn.org/

2020-01-23 Thread Stuart A. Yeates
My understanding is that the database at https://portal.issn.org/ contains the official information for all registered ISSNs and that while it's free to access on a record-by-record basis, they charge for API access and don't permit the redistribution of the results (and goes to great length to tal

Re: [CODE4LIB] ode to fred kilgour

2020-01-06 Thread Stuart A. Yeates
If you know the original photographer and are in contact with them, would it be possible to get them to upload a copy to https://commons.wikimedia.org/wiki/Special:UploadWizard for use in https://en.wikipedia.org/wiki/Fred_Kilgour ? Ideally get the original uploaded for provenance purposes, the wi

Re: [CODE4LIB] Systematic / Systemic bias in bibliometics

2019-07-17 Thread Stuart A. Yeates
gt; ORCiD: https://orcid.org/-0001-5059-4132 > > > On 2019/07/17, 1:30 PM, "Code for Libraries on behalf of Stuart A. Yeates" > wrote: > > I'm looking for work or discussions on systematic bias in > bibliometrics or appropriate fora wh

[CODE4LIB] Systematic / Systemic bias in bibliometics

2019-07-17 Thread Stuart A. Yeates
I'm looking for work or discussions on systematic bias in bibliometrics or appropriate fora where such discussions are likely to happen. Even critical analysis of the founding assumptions of bibliometrics as a field would be a good place to start I have some ideas but they seem obvious and I'm afr

Re: [CODE4LIB] From the Community Support Squad wrt "Note [admiistratativia]"

2019-07-14 Thread Stuart A. Yeates
I was personally ambivalent about anonymity on the mailing list. However, the fact that it appears to be predominantly men arguing for banning anonymity and women arguing for allowing it is a tell that us men folk might have our lower appendages in our orifices. cheers stuart -- ...let us be hear

Re: [CODE4LIB] Note [administratativia]

2019-06-30 Thread Stuart A. Yeates
In the spirit of fixing actual things that need to be fixed, I'd like to point out the really positive news out of the ALA about them recognising Dewey https://www.theguardian.com/books/2019/jun/27/melvil-deweys-name-stripped-from-top-librarian-award and AAAS about them recognising issues with curr

Re: [CODE4LIB] code4lib journal [reader]

2019-06-13 Thread Stuart A. Yeates
gan wrote: > > On Jun 12, 2019, at 8:40 PM, Stuart A. Yeates wrote: > > >> The Distant Reader [0] harvests an arbitrary number of user-supplied files > >> or links to files, transforms them into plain text files, and performs > >> numerous natural language proce

Re: [CODE4LIB] code4lib journal

2019-06-12 Thread Stuart A. Yeates
Taking a look at distant reader (which I don't believe I've looked at before): (a) It would be great to sanity-check the corpus by running language identification on each of the files (b) There are a whole flotilla of technical identifiers that could useful be extracted from the text files (DOIs,

Re: [CODE4LIB] Ready for Review: OCFL 0.3 (Beta)

2019-06-04 Thread Stuart A. Yeates
Is there a clear statement of the problem OCFL is trying to solve? I'm a third of the way through and it looks like METS with JSON replacing XML and all references to existing metadata schemas (MARC, dublin core, etc) stripped. cheers stuart -- ...let us be heard from red core to black sky On Wed

Re: [CODE4LIB] altmetric

2019-04-22 Thread Stuart A. Yeates
Altmetic is great for capturing social media buzz. Unfortunately it has a very narrow selective view of impact, which means that many classes of publication aren't captured. It also has no notion of conflict of interest, so institutions which have twitter bots connected to their institutional rep

Re: [CODE4LIB] AuthorityBox

2019-04-16 Thread Stuart A. Yeates
That's great! I've done a lot of work with VIAF, ORCID, etc in wikipedia and one of the huge issues is with multiple clusters for the same person. The wiki page for the list of VIAF errors is so long it no longer loads on some of my browsers... cheers stuart -- ...let us be heard from red core to

Re: [CODE4LIB] Online data transformation tools

2019-04-15 Thread Stuart A. Yeates
I would also add somewhere links to the definitions / standards for each of these files types. Not everyone who encounters MARC can be expected to know all the other acronyms-as-file-formats. cheers stuart -- ...let us be heard from red core to black sky On Tue, 16 Apr 2019 at 08:58, Kyle Baner

[CODE4LIB] Browzine / thirdiron off campus authentication issue after cookies cleared?

2018-11-05 Thread Stuart A. Yeates
Are there any other Browzine / thirdiron (https://thirdiron.com/) users out there who are seeing an issue with browzine forgetting which institution it's meant to be authenticating against when used by off campus users who have not previously logged into our institutional systems with this browser?

Re: [CODE4LIB] exact relationship between DOIs and handles?

2018-08-19 Thread Stuart A. Yeates
fier as a Handle, and not first validating that the Handle is > actually a valid DOI. I'd regard that as a bug in the DOI's resolver, > personally. > > Cheers! > > Conal > > > On Fri, 17 Aug 2018 at 09:37, Stuart A. Yeates syea...@gmail.com>

Re: [CODE4LIB] exact relationship between DOIs and handles?

2018-08-16 Thread Stuart A. Yeates
't be too hard to add. cheers stuart -- ...let us be heard from red core to black sky On Fri, 17 Aug 2018 at 14:20, Stuart A. Yeates wrote: > Interesting insight Conal, I wasn't aware of that service. > > https://doi.org/10063/1710 redirects to > http://researcharchive.

Re: [CODE4LIB] exact relationship between DOIs and handles?

2018-08-16 Thread Stuart A. Yeates
t; actually a valid DOI. I'd regard that as a bug in the DOI's resolver, > personally. > > Cheers! > > Conal > > > On Fri, 17 Aug 2018 at 09:37, Stuart A. Yeates wrote: > > > We have a DSpace instance that is configured to issue handle.net &g

[CODE4LIB] exact relationship between DOIs and handles?

2018-08-16 Thread Stuart A. Yeates
We have a DSpace instance that is configured to issue handle.net identifiers to all items, so links such as: https://researcharchive.vuw.ac.nz/handle/10063/1710 http://researcharchive.vuw.ac.nz/handle/10063/1710 https://hdl.handle.net/10063/1710 http://hdl.handle.net/10063/1710 all take a web bro

Re: [CODE4LIB] What DAMS does your institution use, and why?

2018-03-24 Thread Stuart A. Yeates
We're in the process of rolling out Rosetta from ExLibris / Proquest, almost solely based on it's compliance with archival standards. cheers stuart -- ...let us be heard from red core to black sky On 24 March 2018 at 09:09, Turner, Steven wrote: > Hi everyone - At The University of Alabama Lib

Re: [CODE4LIB] BIBFRAME nesting question

2018-01-18 Thread Stuart A. Yeates
> I > haven't thought this through but because BF combines the FRBR work and > expression into a single entity, it may be safe to say that no BF > instance can be an instanceOf more than one BF work. Isn't every edition of 'Complete Works of Shakespeare' an instanceOf each of the plays? cheers st

Re: [CODE4LIB] c4l Journal unreachable

2018-01-12 Thread Stuart A. Yeates
In the meantime: http://web.archive.org/web/20171202212012/http://journal.code4lib.org/articles/5468 cheers stuart -- ...let us be heard from red core to black sky On 13 January 2018 at 07:05, Karen Coyle wrote: > The link: > > http://journal.code4lib.org/ > > goes to https://www.ibiblio.org/

Re: [CODE4LIB] curating code4lib

2017-12-12 Thread Stuart A. Yeates
Someone is on the list is bound to have extra megabytes left on their archive.org sub at the end of the period. Maybe we could have a wiki page describing the best crawl config so nothing gets left out? Remember that re-crawling the same content doesn't incur a cost... cheers stuart -- ...let us

Re: [CODE4LIB] Give your opinion: PREMIS in METS Toolbox

2017-11-05 Thread Stuart A. Yeates
For those of you blissfully unaware of what PREMIS in METS is / looks like (as I confess I was a year ago), take a look at https://raw.githubusercontent.com/stuartyeates/METS_metadata_for_complete_beginners/master/thesis-exported/ie.xml (search for 'premis:'). cheers stuart -- ...let us be heard

Re: [CODE4LIB] Online suggestion form or box

2017-10-24 Thread Stuart A. Yeates
We're using https://www.zendesk.com/ for end user feedback. It's great. cheers stuart -- ...let us be heard from red core to black sky On 25 October 2017 at 09:04, Jason Bengtson wrote: > I've used a variety of them in the past for projects . . . often through > google forms. I haven't done it

Re: [CODE4LIB] Persistent Identifiers for organizations/institutions.

2017-10-14 Thread Stuart A. Yeates
On 14 October 2017 at 07:11, Edward Summers wrote: > What if we created an identifier system that organizations would pay an > annual feel to belong to? This identifier would be guaranteed to be > globally unique as long as the organization cared to maintain it. You could > use this identifier wi

Re: [CODE4LIB] Needed: some metadata documentation examples

2017-10-11 Thread Stuart A. Yeates
Do you mean something like http://rioxx.net/ ? cheers stuart -- ...let us be heard from red core to black sky On 11 October 2017 at 10:20, Karen Coyle wrote: > Hello, all. I'm working on some projects where we are trying to define > formats and guidance for metadata *profiles*. You may have se

Re: [CODE4LIB] SIP/AIP Content Guidelines

2017-06-02 Thread Stuart A. Yeates
We're currently migrating a number of in-house and DSpace systems into SIPs for ingest into Rosetta. My advice is to put as much metadata as possible into the METS. For example, just this week we seriously discussed putting sushi sections in at the item level. Previously sushi was done by very dif

[CODE4LIB] authority files optimised for linked open data?

2017-05-25 Thread Stuart A. Yeates
I'm currently scripting the conversion of >10 authority records into MARC. I have quite a bit of sundry information about some of the people and I'd like encode this in a way which makes it optimal for use / reuse as LOD. Does anyone have any pointers or examples? cheers stuart -- ...let us b

Re: [CODE4LIB] File name extension in bitstream URL for preservation harvesting

2017-04-24 Thread Stuart A. Yeates
Quality digital preservation software will ignore file extensions and file names for file type identification and use JHOVE or similar. Cheers Stuart On Tuesday, April 25, 2017, Benedikt Kroll wrote: > That would work, but I'm rather trying to find out whether digital > preservation softwares h

Re: [CODE4LIB] Google Marking Code4Lib As Spam

2017-01-24 Thread Stuart A. Yeates
Most SPAM filters use distributed feedback as an important feedback loop. This means that if individuals on the list want to do something about this, they can check their SPAM box for code4lib messages and mark them as not spam. To find these enter "in:spam code4lib" into the gmail search box. ch

Re: [CODE4LIB] How to archive selected pages from a site requiring authentication

2017-01-18 Thread Stuart A. Yeates
https://archive-it.org/ the subscription service of https://archive.org/ does login-protected sites. We've found them to be very helpful and the software to just work, but we've never done any password protected sites. cheers stuart -- ...let us be heard from red core to black sky On Thu, Jan 1

Re: [CODE4LIB] synonomizer

2017-01-17 Thread Stuart A. Yeates
If anyone is keen, another likely source of synonyms is Wikipedia redirects. https://www.mediawiki.org/wiki/API:Redirects cheers stuart -- ...let us be heard from red core to black sky On Wed, Jan 18, 2017 at 3:10 AM, Eric Lease Morgan wrote: > I wrote a little hack called the Synonomizer, a P

Re: [CODE4LIB] MARCXML help again

2017-01-10 Thread Stuart A. Yeates
That is, if the XML is completely consistent AND you're guaranteed to never encounter MARC data with XML special characters, then Kyle's suggestion is an excellent one. I really need to find an excuse to publish a document with a title starting " wrote: > Well, I think that's a *bit* harsh. But t

Re: [CODE4LIB] MARCXML help again

2017-01-10 Thread Stuart A. Yeates
That should have read: ... or more readibly as: cheers stuart -- ...let us be heard from red core to black sky -- Forwarded message -- From: Stuart A. Yeates Date: Wed, Jan 11, 2017 at 10:28 AM Subject: Re: [CODE4LIB] MARCXML help again To: Code for Libraries You need an

Re: [CODE4LIB] MARCXML help again

2017-01-10 Thread Stuart A. Yeates
You need an identity transform + a no-op template such as: or more readibly as: cheers stuart -- ...let us be heard from red core to black sky On Wed, Jan 11, 2017 at 10:13 AM, Julie Swierczek wrote: > Thanks to all who responded to my earlier plea for help. I now have a new > problem.

Re: [CODE4LIB] 46 gigabytes

2016-12-13 Thread Stuart A. Yeates
A question I'd start with is: Are you associated with an institution with a archive that you could deposit into in return for defraying their costs? cheers stuart -- ...let us be heard from red core to black sky On Wed, Dec 14, 2016 at 10:00 AM, Eric Lease Morgan wrote: > On Tue, Dec 13, 2016

[CODE4LIB] Fwd: Third party use of openurl resolver?

2016-11-30 Thread Stuart A. Yeates
I'm currently trying to systematise our use of EZproxy for non-purchased/licensed resources. There are two classes of non-purchased/licensed resources that I'm aware of. The first is the class of PURL-like: services http://doi.org/, http://purl.org/, http://handle.net/ and so forth which are used

Re: [CODE4LIB] OCLC's EZproxy stanzas collected anywhere?

2016-11-15 Thread Stuart A. Yeates
There is a mailing list at http://www.oclc.org/support/services/ezproxy/documentation/list.en.html which is a great source for info like this. cheers stuart -- ...let us be heard from red core to black sky On Wed, Nov 16, 2016 at 3:18 PM, Kate Deibel wrote: > Ranti Junus pointed me at this scr

Re: [CODE4LIB] PGP

2016-10-28 Thread Stuart A. Yeates
PGP has a dreadful reputation for usability, be prepared for a significant support burden if you take that route. You could always try omitting details from the email but providing a link: "You have 4 books due tomorrow, click here and login to see the details" kind of thing. That in conjunction w

Re: [CODE4LIB] University libraries supporting research databases/websites?

2016-09-03 Thread Stuart A. Yeates
"Database" covers a multitude of sins. Relational databases (MySQL / Oracle / postgres / etc) are unlikely to be things that most libraries / librarians view as within their core professional mandate. Textual databases (which may or may not have an associated relational database driving certain a

Re: [Code4Lib] Schema Validations at LOC

2016-07-31 Thread Stuart A. Yeates
The real alternative is to build your own XML catalogue. This also makes things faster by eliminating network lookups. For linux users there's the helpfully named 'xmlcatalog' command to build and maintain them. cheers stuart -- ...let us be heard from red core to black sky On Sun, Jul 31, 2016