Re: [CODE4LIB] accessing a python compressed sparse row format object

2017-09-27 Thread Eric Lease Morgan
On Sep 26, 2017, at 4:41 PM, Thomale, Jason  wrote:

 Does anybody here know how to access a Python compressed sparse row format 
 (CSR) object? [1]
 
 [1] CSR - http://bit.ly/2fPj42V
>>> 
>>> Do you have a link to the code you're using?
>> 
>> Yes, thank you. See —> 
>> http://dh.crc.nd.edu/sandbox/htrc-workset-browser/bin/topic-model.py  —ELM
> 
> I'm not familiar with the APIs in question, but--if I'm looking at this 
> right, your CSR matrix (tfidf) looks like it would have columns corresponding 
> with topics and rows corresponding with documents. If that's the case, you 
> could maybe do something like this:
> 
>   1. Use tfidf.getcol() to get the column corresponding
>  to your chosen topic. Looks like that should give you a
>  1-dimensional matrix of all document scores for that
>  topic.
> 
>   2. Cast that to an array of scores using .toarray(),
>  and then a list with .tolist(). (I think?)
> 
>   3. Use a list comprehension and "enumerate" to generate
>  explicit doc IDs based on each document's position in
>  the list, creating a list of 2-element lists or tuples,
>  (doc_id, score). While you're at it, you could filter
>  the list comprehension to give you only the documents
>  with scores that are greater than 0, or some other
>  threshold.
> 
>   4. Pass the results through the built-in "sorted"
>  function to sort your list of tuples based on score.
> 

> >>> topic = 9497
> >>> score_thresh = 0
> >>> topic_scores = tfidf.getcol(topic).toarray().tolist()
> >>> docs_and_scores = [(score[0], score[1]) for score in 
> >>> enumerate(topic_scores) if item[1] > score_thresh]
> >>> most_relevant_docs = sorted(docs_and_scores, key=lambda x: x[1])
> 
> The resulting "most_relevant_docs" variable should be a list of tuples that 
> looks something like this (for example):
> [(102, 0.9), (33, 0.875), (365, 0.874), ...]
> 
> Not sure if that's helpful...? There's probably a more numpy/scipy way of 
> doing the above using actual numpy array methods (especially the 4th line).


Jason, this is REALLY close, and I have begun to include it at the very end of 
my code. Thank you! ‘More later. code4lib++  —Eric Morgan


Re: [CODE4LIB] accessing a python compressed sparse row format object

2017-09-27 Thread Eric Lease Morgan
On Sep 26, 2017, at 3:53 PM, Andromeda Yelton  
wrote:

> These people might be doing something similar enough to your use case to be
> useful?
> 
> https://de.dariah.eu/tatom/topic_model_python.html#inspecting-the-nmf-fit


Yes, thank you. I’ve seen the particular article, above. It is very similar. 
—Eric M


[CODE4LIB] [EDUC]: October Classes from Amigos Library Services

2017-09-27 Thread Jodie Borgerding
Amigos Library Services has a great line up of online classes in October. Seats 
are filling up so don't delay in registering!

Know & Go: The Power of Presenting - October 2, 2017, 1:30 pm - 2:30 pm CDT
Speaking at conferences and other events is an opportunity to share ideas and 
get your name out among the library field. If you've never presented at a 
conference before, it can be intimidating to decide what to talk about, what 
format, and for what conference. In this session, we will talk about why public 
speaking is an important skill to master, different types of common 
presentation formats, and offer suggestions to help you choose the best format 
for your topic. In addition, we will talk about tips and strategies to help you 
become a better presenter. Register today at: https://www.amigos.org/node/4517.

RDA for Copy Catalogers - October 4 - 5, 2017, 10:30 am - 12:30 pm CDT
Copy Catalogers, whether or not your library has implemented RDA, you're 
encountering more and more RDA records in your daily work. This two-day, 
live-online workshop will help you to recognize and interpret MARC records 
created using RDA. Among the topics to be covered: FRBR review; new RDA MARC 
fields; differences between AACR2 and RDA descriptions; access points for 
persons, families, corporate bodies, works and expressions; relationship 
designators added to access points; examples of RDA bibliographic records. 
Register today at: https://www.amigos.org/node/3115.

Makerspaces in the Library: How to plan for success! - October 11, 2017, 10:00 
am - 12:00 pm CDT
Implementing a makerspace into your library's services can be a challenging and 
exciting project for public, special, school and academic librarians alike. 
This course will utilize feedback from other successful makerspaces, and look 
at how makerspaces can be designed to promote library services already in 
place. We will also be talking about what it means to be innovative and 
interdisciplinary with your library's programming. Library services should be 
reevaluated on a regular basis and interpreted liberally with planning your 
makerspace. Register today at: https://www.amigos.org/node/4645.

RDA for Video Recordings - October 11 - 12, 2017, 10:30 am - 12:30 pm CDT
This workshop covers the RDA instructions relevant to cataloging carrier-based 
and electronic video recordings. Among the topics to be covered: descriptive 
elements; choice of preferred source of information; access points, 
relationship designators. Register today at: https://www.amigos.org/node/3244.

The Usable Library - October 18 & 20, 2017, 10:00 am - 12:00 pm CDT
As librarians, one of our main goals is to connect people with the information 
that they need. While librarians play a very large part in this process, we 
must not forget the role our libraries themselves play in this process. Our 
libraries should be attractive, yes, but above all, they should be usable. In 
this course, we will identify key barriers to a library's usability. We will 
discuss possible solutions for improving the layout and signage of a library, 
optimizing its usability. Finally, we will outline how to conduct a usability 
test in your library. Register today at: https://www.amigos.org/node/4044.

Teaching Beginner Computer Classes: Seniors and Beyond - October 23 -24, 2017, 
1:00 pm - 3:00 pm CDT
Developing basic computer skills is a challenge that college students and 
seniors alike can face. Libraries increasingly offer beginning computer classes 
in both the public and academic fields in an attempt to bridge the gap between 
people and information. In this class, we will identify the applications, tools 
and skills computer beginners need, distinguishing the needs of seniors from 
those of younger learners. We will discuss the disposition and patience needed 
by computer class instructors, and finally, we will develop sample curricula 
for a computer class in your library. Register today at: 
https://www.amigos.org/node/3671.

Dealing with Difficult Patrons - October 24 & 26, 2017, 2:00 pm - 4:00 pm CDT
Everyone working in a library will at one point have an interaction with a 
difficult patron. Whether the patron is frustrated with library policy, refuses 
to follow standard rules, or is simply argumentative or persistent, librarians 
must develop interpersonal skills in order to deal with these patrons. In this 
course, we will apply standard customer service skills when relating to 
difficult patrons. We will also list common patron difficulties as well as ways 
to effectively solve problems. Register today at: 
https://www.amigos.org/node/2008.

Know & Go: Ransomware - October 30, 2017, 1:30 pm - 2:30 pm CDT
Wanna Cry, a 2017 computer attack, is an example of ransomware. The attackers 
demanded $300 for every computer it infected. If the money was not paid within 
7 days, all files on that computer were deleted. But what is ransomware and how 
does it work? More importantly, is there anything we 

[CODE4LIB] Fwd: Community Meeting on Open Source Discovery Layers at the DLF Forum

2017-09-27 Thread Stephen Meyer
Forwarding on behalf of Shelley.

At UW-Madison we have had very productive conversations with Ex Libris 
development staff at other conferences to work through how our locally 
developed discovery interface works with their Alma publishing processes and 
APIs. They are engaged with the community and folks using Alma who may be 
considering discovery interfaces like Blacklight or VUFind should give this 
consideration.

-sm

Begin forwarded message:

From: Shelley Hostetler 
Subject: FW: Community Meeting on Open Source Discovery Layers at the DLF Forum
Date: September 20, 2017 at 11:54:11 AM CDT
To: "stephen.me...@wisc.edu" 

Join us for A Community Meeting at the Digital Library Federation Forum
 
Wednesday, October 25th | 1:00 - 3:00 PM Eastern

Westin Convention Center | Room: Pennsylvania East

Please join us for a collaborative discussion during the upcoming DLF Forum on 
October 25th. This meeting is a continuation of our discussions with the 
community at both Digital Library Federation Forum 2016 and Code4Lib 2017, open 
for anyone interested in using open source discovery layers with Ex Libris 
products. University of Pennsylvania will discuss their experience implementing 
Blacklight over Alma with the Summon central index. Ex Libris will also share 
progress on enhancing Alma and Summon APIs to facilitate easier and more robust 
interactions with open source discovery solutions. We encourage you to come, 
learn and share both your thoughts and your questions regarding using open 
source discovery (such as Blacklight or VUFind) with Primo, Summon and Alma.

Please email customersucc...@exlibrisgroup.com with your full name and 
institution to register for the event.


[CODE4LIB] Job: Fellowship in the Digital Humanities at Villa I Tatti / Harvard University

2017-09-27 Thread Code4Lib Jobs
This fellowship aims to support the work of scholars in the humanities or 
social sciences, librarians, archivists, and data science professionals whose 
research interests or practice cut across traditional disciplinary boundaries 
and actively employ technology in their work. Projects can address any aspect 
of the Italian Renaissance, broadly understood historically to include the 
period from the 14th to the 17th century, and geographically to include 
transnational dialogues between Italy and other cultures (e.g. Latin American, 
Mediterranean, African, Asian etc.). Projects should apply digital technologies 
such as mapping, textual analysis, visualization, or the semantic web to topics 
in fields such as art and architecture, history, literature, material culture, 
music, philosophy, religion, and history of science.

I Tatti offers Fellows the precious time they need to pursue their studies with 
a minimum of obligations and interruptions together with a maximum of scholarly 
resources—a combination that distinguishes the Harvard Center from similar 
institutions. In order to foster a collaborative spirit, Fellows are expected 
to live in the Florence area and to spend at least three days a week at the 
center. Lunch and tea are served each weekday, and the I Tatti community takes 
shape over these convivial occasions. Each year, a limited number of activities 
organized at I Tatti are reserved for the Fellows, and they join the wider 
community at conferences, lectures, and concerts.

This is a residential fellowship of 4 or 6 months in length that will provide 
the time, space, and resources necessary to discover, explore, and build. Up to 
two fellowships will be awarded every year.



Brought to you by code4lib jobs: 
https://jobs.code4lib.org/jobs/27841-fellowship-in-the-digital-humanities


[CODE4LIB] Job: Director of Library Information Technology at California Polytechnic State University

2017-09-27 Thread Code4Lib Jobs
CAL POLY IS CALLING!  SEEKING A DIRECTOR OF LIBRARY INFORMATION TECHNOLOGY

Overview of the Role
As a member of the Library Management Team, the Library Information Technology 
Director plays a strategic role in leadership, planning, management, and 
implementation of innovative and responsive library systems, applications, and 
technology services that support the teaching, learning, and research 
activities of Cal Poly. 

·   Reports Directly to the Dean of Library Services

·   Plays a strategic role in leadership, planning, management, and 
implementation of innovative and responsive library systems, applications, and 
technology services that support the teaching, learning, and research 
activities of Cal Poly

·   Oversees the administration and maintenance of existing systems

·   Guides and supports the selection, adoption, and technical integration 
of emerging library technologies, automated systems, and digital initiatives 

·   Responsible and accountable for providing technology leadership, 
developing and managing a staff of IT professionals and student assistants

About Library Information Technology 
 

6 Direct Reports to the Director of Library IT

Workstation Specialist
Lead Applications Programmer and Integrator
Digital Media Specialist
Designer for Web, Graphics and UX
Library Applications Specialist
Server Specialist

A Successful Candidate Will Have Had Accomplishments In The Following:

Innovative, forward thinking and collaborative team player
Embracing a dynamic and forward looking library management team
Open, transparent, direct, and collaborative communication style
Leadership skills in developing and managing IT professionals and student 
assistants
People management skills especially holding people accountable for their work
Experience managing a $300k - $500k budget

Minimum Qualifications, Education And Experience:

Bachelor’s degree
Minimum of 5 years of professional experience in an academic or research 
library or related technology field, including at least five years of 
progressively responsible management and supervisory experience

The entire job posting is available @   
www.calpolyjobs.org/applicants/Central?quickFind=166379      

About Robert E. Kennedy Library 
Integrating traditional library resources and services with digital tools, the 
Robert E. Kennedy Library continuously adapts its services, programs, and 
spaces to meet the evolving academic programs, research interests, and needs of 
the university community. Providing access to information resources, 
classrooms, computer labs, an expansive 24-hour space, and an array of group 
and individual study spaces, Kennedy Library is a magnet for students and 
faculty alike. Annual visitors exceed 1.4 million. Kennedy Library's innovative 
peer-to-peer learning and reference program simultaneously leverages the 
benefits of student-to-student learning and Cal Poly's signature pedagogical 
ethos of Learn by Doing. There is an active Student Library Advisory Council 
and in 2014 the Kennedy Library received the prestigious ACRL Excellence in 
Academic Libraries award. The library partners with campus organizations to 
promote faculty development and student success, including several that are 
located in the library: the Center for Teaching, Learning and Technology; the 
Academic Skills Center; and the Honors Program. In addition, the University 
Writing and Rhetoric Center will soon find a new home in the library. In 
testimony to the perceived value of the Robert E. Kennedy Library across the 
entire campus, various stakeholders and the library are in the process of 
strategizing, planning and launching a fully funded renovation, which will 
increase the library's ability to provide services and spaces to further 
support student and faculty success. 

The Benefits
As part of the CSU system, Cal Poly offers a very generous benefits package, 
including the following:

Enrollment in CalPERS (Public Employee Retirement System)
Health, dental, and vision insurance at competitive rates or no cost to employee
Program to allow employee or an eligible family member to attend classes at any 
of the CSUs for reduced fees
More information is available on the Cal Poly benefits page 
https://afd.calpoly.edu/hr/benefits/


U.S. News Names Cal Poly Best in the West for 24th Consecutive Year

Cal Poly is ranked as the No. 1 public, master’s-level university in the West 
by U.S. News & World Report’s America’s Best Colleges guidebook again this 
year – a ranking awarded to Cal Poly every year since 1993.

The 2017 guidebook lists Cal Poly in a tie with Saint Mary’s College for ninth 
place overall in the West of regional universities, public and private. This is 
up from 10th place last year.

“Our continued inclusion in the prestigious U.S. News rankings is a testament 
to the Cal Poly community’s unwavering passion for providing the best possible 
Learn by Doing experience for all of our students,” said C

[CODE4LIB] New project: dat in the lab

2017-09-27 Thread John Chodacki
Quick announcement:

Gordon and Betty Moore Foundation has awarded a 
research grant to the California Digital Library and 
Code for Science & Society (CSS) for the 
Dat-in-the-Lab project to develop practical new techniques for effective data 
management in the academic research environment.

The project will pilot the use of CSS’s 
Dat system to streamline data 
preservation, publication, sharing, and reuse in two UC research laboratories: 
the Evolution: Ecology, Environment lab at UC 
Merced, focused on basic ecological and evolutionary 
research under the direction of Michael 
Dawson; and the Center for 
Watershed Sciences at UC Davis, dedicated to 
the interdisciplinary study of water challenges.  UC researchers are 
increasingly faced with demands for proactive and sustainable management of 
their research data with respect to funder mandates, publication requirements, 
institutional policies, and evolving norms of scholarly best practice.  With 
the support of the UC Davis and UC 
Merced Libraries, the project team will conduct a 
series of site visits to the two UC labs in order to create, deploy, evaluate, 
and refactor Dat-based data management solutions built for real-world data 
collection and management contexts, along with outreach and training materials 
that can be repurposed for wider UC or non-UC use.

More information on the project here: 
http://uc3.cdlib.org/2017/09/27/moore-foundation-supports-uc3-research-data-management-project/
Feel free to RT: https://twitter.com/UC3CDL/status/913189766618681344

In addition, you can bookmark 
Dat-in-the-Lab
 on GitHub for access to code, curricula, and other project outputs.  Also 
follow along as the project evolves on our 
roadmap,
 chat with the project team, and keep up to date 
through the project Twitter feed.  For more 
information about UC3, contact us at u...@ucop.edu and 
follow us on Twitter.



[CODE4LIB] Seeking Hosts for Fellowships in Data Curation for Latin American & Caribbean Studies

2017-09-27 Thread Alice Bishop
The Council on Library and Information Resources (CLIR)
 is currently seeking host institutions for 2-year
fellowships in data curation for Latin American and Caribbean Studies.

These two-year fellowships, supported by a grant from The Andrew W. Mellon
Foundation, will provide five recent Ph.D.s with professional development,
education, and training opportunities in data curation for the humanities
or related social sciences focused on Latin American and Caribbean Studies.

Fellowships will begin between June and September, 2018. Fellows must have
completed the Ph.D. within the five years preceding the fellowship
appointment. Fellows' research expertise must be in an area that relates to
Latin American and Caribbean Studies.  Fellows' salaries and educational
benefits are fully funded through CLIR for selected hosts.

For the 2018-2020 cohort, an advisory panel of experts will select five
institutions with nationally recognized strengths in Latin American and
Caribbean Studies, the management of large-scale digital projects, and data
curation.

*Preference will be given to those institutions that exhibit strong,
existing partnerships with institutions or organizations in Latin America
and/or the Caribbean and have not previously hosted a CLIR/DLF fellow in
this area*. Prospective hosts should submit *a fellowship proposal* fellowship
proposal 
by *5:00PM
EST October 18, 2017*.

For additional information please visit https://www.clir.org/fel
lowships/postdoc/hosts/fellowships-in-data-curation-for-lati
n-american-and-caribbean-studies or contact Alice Bishop, Senior Program
Officer, at abis...@clir.or or by phone at (202) 939-4763.