15:00:32 #startmeeting 2016-03-15 - Evergreen focus group discussion on search 15:00:32 Meeting started Tue Mar 15 15:00:32 2016 US/Eastern. The chair is kmlussier. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:00:32 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:00:32 The meeting name has been set to '2016_03_15___evergreen_focus_group_discussion_on_search' 15:00:49 Huh, just realized it's March 15. Beware the Ides of March! 15:01:00 #topic Introductions 15:01:09 Please introduce yourselves, as follows 15:01:16 #info kmlussier is Kathy Lussier, MassLNC 15:01:24 sherbertbc is Sharon Herbert, Sitka/BC Libraries Cooperative 15:01:31 #info ethomsen is Elizabeth Thomsen, NOBLE 15:01:49 #info rhamby is Rogan Hamby, Equinox Software 15:01:52 #info pams Pamela Smith howe library, hanover, nh 15:02:09 #info csharp is Chris Sharp, GPLS (partially here) 15:02:55 While we wait for more people to come in, I'm posting a link to some ground rules I wrote up. 15:03:03 http://wiki.evergreen-ils.org/doku.p​hp?id=scratchpad:search:focus_groups 15:03:09 #info yboston - Yamil Suárez - Berklee College of Music 15:03:10 bah! That's not going to work 15:03:18 #info onijay is Joni Paxton, Community Library (lurking & learning) 15:03:44 #info sherbertbc is Sharon Herbert, Sitka/BC Libraries Cooperative 15:03:59 http://wiki.evergreen-ils.org/doku.php?id=scratchpad:search:focus_groups 15:04:23 asically, this chat is an opportunity to talk about ideas for search. Although it's tempting, especially for technical folks, to get in the nitty gritty of how to make those ideas happen, we don't want to get bogged down in the details today. 15:04:32 Sorry, that should have said basically. 15:04:50 We really want to focus on end results today. 15:05:05 Also, as is the case with any brainstorming, there are no stupid ideas. Feel free to share whatever ideas you have. And please don't be critical of others ideas. 15:05:18 However, I do encourage you to build upon each other's ideas. 15:05:33 kmlussier: +1 on ground rules 15:05:34 #info dbwells is Dan Wells, Hekman Library (Calvin College) 15:05:52 I'm going to get started on the discussion now. If you come in late, feel free to introduce yourselves in the middle of the discussion. 15:06:08 #topic Strengths of current search 15:06:26 I want to start by asking people to identify what they like about the current Evergreen search. 15:06:37 What are its strengths? 15:06:44 Is there anything that Evergreen search has that makes it unique (in a good way)? 15:07:55 #info miker == Mike Rylander, Equinox 15:08:44 The reason I ask this question is because, if we make major changes to search, we want to make sure we don't lose the things that are already working well for us. 15:09:08 So think of the question in terms of things you don't want to lose. 15:09:32 separate scope and depth search ranging 15:09:47 Like I have said before, I like that it can handle diacritics, and we can filter by format type 15:09:59 Excellent! Thanks miker! 15:10:17 save to list, filter by availability 15:10:20 configurable visibility for URIs (with an outstanding patch to improve that) 15:10:46 sherbertbc: Filtering by availability is a good. Thanks! I don't think it's come up in our past conversations. 15:10:50 sherbertbc: and, more broadly, filter by status (which is how #available works under the covers) 15:10:53 miker: Are you speaking of Located URI's? 15:11:01 kmlussier: I am :) 15:11:05 our users (staff and patron) appear to want the ability to have a non-keyword search that just sorts by format, location, and date (e.g. "give me the newest DVDs for my library") 15:11:34 oops - may not be the time for that 15:11:42 csharp: We'll get there in a bit. 15:11:43 csharp: that's expressible today, but not with UI widgets. fwiw 15:12:10 Any other key features we need to ensure that we preserve in our search? 15:12:45 I know there are probably a lot of things we like, but it's not always easy to think of them because we just get used to having certain things. 15:13:56 we like the call # numeric/browse shelf option 15:14:21 pams: Great! Thanks for mentioning it. 15:14:31 We like limit to copy location in Adv. Search 15:15:26 We like to do ISBN searches 15:15:46 Yes, in addition to the copy location limiting, I would say there are a lot of power limiters from advanced search. 15:15:55 I like using search by barcode on occasion 15:16:06 also call number search 15:16:26 Onijay: When you say ISBN searches, is that from the Numeric search or is it that you like the ability to do it from a keyword search? 15:16:47 both 15:16:59 Onijay: Gotcha. Thanks! 15:17:20 I'm going to move on to the next topic, but if anyone wants to add other strengths in the course of the conversation, please feel free to do so. 15:17:25 #topic Areas for improvement 15:17:38 Where are the areas where you would like to see improvement? 15:17:46 Speed and relevance 15:18:32 spelling assist like Google, did you mean? 15:19:04 ethomsen: Thanks! I don't think there is much to say about speed, except make it faster, but could you talk a little more about where you would like to see improvements in relevance? 15:19:15 Basically, where do you see relevance coming up short? 15:19:32 pams: Yes, Did you Mean would be helpful. 15:19:51 Are there other areas where we think the catalog could provide more assistance to the users in helping them find what they are looking for? 15:19:59 that's probably the biggest problem our patrons have with our catalog 15:20:00 Maybe it's just our configuration, but we find very short, older records consistently coming to the top of the results 15:20:31 ethomsen: Thanks! 15:21:27 pams: The biggest problem being that the search terms are entered incorrectly, and they need help getting to the right thing? 15:21:37 Our staff finds it difficult to understand (and therefore explain) why certain records are coming up on top -- can't see what's bumping certain records up so high 15:21:52 * miker has thoughts on that ... but they're implementation-related and will hold off (just sticking a pin in it for later) 15:22:17 kmlussier: right, a typo or incorrect spelling returns 0 results 15:22:32 * kmlussier likes it when miker has thoughts, but thanks for holding off on details. :) 15:23:03 (likewise with misspellings... for later :) ) 15:23:11 Or alternate spelling -- we want to maintain local synonym list with matched words and phrases 15:23:11 color=colour, rain forest=rainforest, 1920s=nineteen twenties, etc. 15:23:11 (Would like to show users which words have expanded their search and let them opt out of any word) 15:24:16 ethomsen: That's interesting. You mentioned not being able to see what's bumping certain records. It sounds like you're saying that it's not only that the returned records should be more relevant, but that there's a value to seeing why records are returned in a certain way? 15:24:22 Am I understanding that correctly? 15:25:07 At least for staff in troubleshooting mode (or mood) 15:25:59 jobs 15:26:07 sorry :) 15:26:11 ethomsen: If they were happy with the results, do you think seeing why they're returned in a certain way would be as important? 15:27:00 OK, so I'm seeing speed, relevance improvements, Did You Mean, a synonym list/thesaurus. 15:27:06 apostrophes also pose a problem 15:27:09 It's not really library staff, it's our staff (consortium) trying to understand and to change configs or fix records 15:27:18 ethomsen: OK, gotcha. Thanks! 15:27:21 filtering by format on the search results page could be more obvious, now buried in faceting & labelled as 'genre" not intuitive 15:27:47 Sometimes overlaying records with better ones demotes them in search results! 15:27:49 sherbertbc: Are you saying you would like a format facet? 15:28:22 sherbertbc: or, do you want a "controlled" set for filtering on genre/format? 15:28:24 ethomsen: I'm going to expand upon that a bit and say that record quality should matter in relevance? 15:28:31 We want both physical format and genre facets 15:28:53 sherbertbc: Oh, I think I misunderstood what you were saying. I understand now. 15:29:03 more than just re-labelling genre to facet. I think bringing "Filter results by" to the top of sidebar would help 15:29:17 miker: not sure if ^^ is answering your question though 15:30:00 kmlussier: Not sure if that true, just think short records getting too much attention! To me, if a short and long record both have the word Puffins in the title and subject heading, they should rank the same. 15:30:04 sherbertbc: to put it in "evergreen features" terms, akin to, say, the language or format filter on the advanced search page? 15:30:18 sherbertbc: It sounds like you're saying you would like to see usability improvements to limiting on the search results screen. 15:30:27 kmlussier: +1 15:30:50 yes, the functionality is there, it just isn't very evident to users re: terminology and placement 15:30:51 I have more to say on that, but will hold back for now. :) 15:31:07 ethomsen: Thanks 15:31:15 We definitely want more facets so a user can filter results rather than re-executing search 15:31:52 I want to go back to something ethomsen said earlier: "Would like to show users which words have expanded their search and let them opt out of any word" 15:32:16 In this case, it sounds like you're talking about when the system searches for stemmed terms - is that right? 15:32:42 pams: Also, I would like to hear more from you on where you see problems with apostrophes. 15:33:46 we have a lot of problems where an author or title isn't searchable without the apostrophe 15:33:46 Not necessarily stemming, but any future situation where the system is using synonym lists or other tools to be able to change search 15:34:49 pams: OK, I'll not it down. I know we have some issues with apostrophes too, but I also have some info I need to add to documentation that may be able to help you currently. 15:35:08 pams: it sounds like more indexing normalizers would be helpful to you ("remove apos", etc) 15:35:21 pams: does that sound correct? 15:36:19 The possible feature ethomsen mentioned reminded me of comments made in a previous chat to improve fuzziness. I think the crux of the discussion was that we wanted Evergreen to be smarter with its fuzziness. 15:36:41 How do you all find the current use of stemming in Evergreen search? 15:37:03 We have it turned off. 15:37:07 death to stemming! 15:37:16 kmlussier: yes that is correct 15:37:38 BTW, I sometimes get errors searching terms that include non-alphanumeric characters like "!@#$%^& 15:37:44 er, what i meant to say was: i'm ambivalent about stemming! 15:37:51 jeff: I read that as "knobs for adjusting stemming!" ;) 15:38:00 jeff: Yes, that sounded ambivalent. 15:38:32 I hear smarter fuzziness, whether it's stemming or another way to search word variations. :) 15:39:03 Any other areas of improvements we want to bring up? Or features we see in other search systems that we would love to see in Evergreen? 15:39:05 We like controlled fuzziness -- lists we maintain and can adjust 15:39:38 Or maybe new and innovative things that nobody else has done yet? 15:39:54 ethomsen: I'll paste a link of interest after the meeting if you can stick around for a minute 15:40:02 Thanks! 15:40:48 miker: I'm still trying to come up with a good analogue or metaphor for tuning search knbos. the best i've come up with is this: https://www.amazon.com/dp/B004NCEL4M/ 15:40:48 kmlussier: I'd like to see search augmented with external, curated disambiguation information 15:41:19 jeff: heh... I was expecting a sound board with hundreds of knobs ;) 15:41:49 miker: authority records, linked data...? 15:42:05 "you searched for steven king, who has a few children's books. you might have actually wanted to search for stephen king, a prolific author well known for ..." 15:42:14 ethomsen: and other sources, like wikidata related/broader/narrower lists 15:42:19 miker: I like the sounds of this. Can you tell us a bit about what the end user would see? 15:42:35 Or maybe jeff helped to answer my question. 15:42:55 that's a use case, yep 15:43:19 more that "did you mean" ... "other things that might be related" 15:44:05 miker: me, too. So you search Fenway Park and it knows your search is probably related to Boston, Boston Red Sox, baseball, baseball parks... 15:44:27 ethomsen: right, like that 15:45:14 OK, I'm going to wrap up this topic now to move on to a bit more detail on relevance. But I think this one ended on a high note. 15:45:17 I love Wikipedia's disambiguation pages, so much better for public than authority displays! https://en.wikipedia.org/wiki/Mars_%28disambiguation%29 15:45:52 #topic Defining relevance 15:45:55 kmlussier: another enhancement, improve metarecord grouping (in many ways, manual and otherwise), and make them first-class, useful things 15:46:08 I really hated interrupting the disambiguation discussion for this. 15:46:09 miker: YES! 15:46:37 miker: OK, we'll digress for a second. Tell me about the first-class, useful things we'll be turning the into. 15:48:07 :) sorry ... so, with better grouping mechanisms (more tunable automated grouping, possible to have multiple MRs per record, cataloger-adjustable MR inclusion) I think some classes of searches would benefit from having MRs be the main result list 15:48:30 and, that result list could (should, IMO) surface information about the constituent records right up front 15:48:56 have the "work" level info, then "manifestation" info below that for each constituent 15:49:31 it would be a larger result page, but get the user to the thing they wanted faster ... 15:49:42 And is this somewhat related to the code4lib talk you saw? Because I need a picture to help me with see this. 15:49:45 not applicable to all search needs, certainly 15:50:03 kmlussier: it is. I haven't had a chance to find the timecode for the start of the talk yet, though 15:50:10 OK, thanks 15:50:12 but, the videos for thursday are up now 15:50:13 How about being able to be looking at a single record and being able to jump from there to other formats/editions, like in Amazon? 15:50:20 Now I want to move on to relevance. 15:50:20 ethomsen: YES! 15:50:24 :) 15:50:42 kmlussier: sorry for derailing 15:50:53 No, that's okay, That's what this discussion is for. :) 15:51:39 I often hear people say that relevance needs to be improved. But relevance for one person may be not-so-relevant for another. I think we really need to define what relevance means for us. 15:52:00 My question is, what factors should be used to identify a record as being highly relevant in a search? 15:52:29 We heard a bit of this above. We heard that we don't want them to be older and shorter records. And I think ethomsen mentioned words appearing in specific fields. 15:52:38 What else should contribute to relevance? 15:53:18 Obvious aboutness -- searched terms appearing near each other in author, subject, title fields coming up first 15:53:35 We have great hopes for activity metric, too. 15:54:17 I'm hearing three things here: Words in specific fields, word proximity, and popularity. 15:54:40 Other than the shorter / older records, are there other places where we consistently see Evergreen not returning relevance records? 15:55:53 I would have added more recent titles should tend to appear higher, but I think activity metric will tend to have that effect anyway 15:56:35 Well, I think recency should be identified anyway. Just to make sure it's not overlooked. 15:57:09 I'm going to leave this question out for about 30 seconds longer before I move on to the final question. 15:57:50 #topic Top-priority improvements 15:58:04 If you had just two improvements you could make to Evergreen search, what would they be? 15:58:07 We love summary notes and contents notes, but they have so many words in them they often overwhelm the search results with titles of low relevance. I'd like to be able to be able to turn "Search within contents and summary notes?" on or off. 15:59:21 ethomsen: Thanks for adding that idea. That particular idea hasn't been raised yet, but it correlates with something I was thinking recently. 15:59:22 Are we allowed to say speed and relevance? 15:59:32 ethomsen: Yes, if those are the two highest priorities. 15:59:49 Speed and relevance 16:00:13 For the first chat, I only let people pick one. Most people picked speed, so JBoyer suggested I expand to two improvements. 16:00:43 Anyone else have a top 2? I know there were more people sharing ideas earlier. :) 16:01:45 I would say relevance & spelling 16:01:51 Thanks pams! 16:02:35 We're pass the two minute mark, so I'm going to wrap up. But if anyone else wants to send me there two, just let me know. I'll add it to my notes even if it doesn't make the official logs. 16:02:41 #endmeeting