15:01:54 <kmlussier> #startmeeting 2016-03-16 - Evergreen focus group discussion on search 15:01:54 <pinesol_green> Meeting started Wed Mar 16 15:01:54 2016 US/Eastern. The chair is kmlussier. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:01:54 <pinesol_green> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:01:54 <pinesol_green> The meeting name has been set to '2016_03_16___evergreen_focus_group_discussion_on_search' 15:02:17 <kmlussier> #topic Introductions 15:02:30 <kmlussier> Please introduce yourselves, as follows 15:02:36 <kmlussier> #info kmlussier is Kathy Lussier, MassLNC 15:02:49 <miker> #info miker == Mike Rylander, Equinox (in and out) 15:02:57 <afterl> #info afterl is Amy Terlaga, Bibliomation 15:03:49 <sarahchilds> #info sarahchilds is Sarah Childs (surprise), Hussey-Mayfield PL, Evergreen Indiana 15:03:49 <kmlussier> While we wait for more people to join us, I'll post a link to our page where we have 1) ground rules for the discussion and 2) logs from our previous chats. 15:04:03 <kmlussier> #link http://wiki.evergreen-ils.org/doku.php?id=scratchpad:search:focus_groups 15:04:22 <rhamby> #info rhamby is Rogan Hamby, Equinox Software (mostly lurking) 15:04:51 <kmlussier> Basically, this chat is an opportunity to talk about ideas for search. Although it's tempting, especially for technical folks, to get in the nitty gritty of how to make those ideas happen, we don't want to get bogged down in the details today. 15:05:07 <kmlussier> We really want to focus on end results today. 15:05:21 <kmlussier> Also, as is the case with any brainstorming, there are no stupid ideas. Feel free to share whatever ideas you have. And please don't be critical of others ideas. 15:05:29 <kmlussier> However, I do encourage you to build upon each other's ideas. 15:06:01 <kmlussier> I think we have a smaller turnout today than we have on other days, so I hope you all feel talkative! :) 15:06:46 <kmlussier> I'm going to start our first topic, but if anyone comes in late, please feel free to introduce yourselves and join in. 15:07:20 <kmlussier> #topic Strengths of current search 15:07:35 <kmlussier> I want to start by asking people to identify what they like about the current Evergreen search. 15:07:45 <kmlussier> What are its strengths? 15:07:48 <dbs> @info dbs is Dan Scott, Laurentian University 15:07:48 <pinesol_green> dbs: (info <url|feed>) -- Returns information from the given RSS feed, namely the title, URL, description, and last update date, if available. 15:07:52 <dbs> #info dbs is Dan Scott, Laurentian University 15:09:47 <dbs> Keyword searches work as a great default for us for the vast majority of searches 15:09:53 <afterl> Search filters work relatively well 15:10:15 <sarahchilds> Filters make good use of MARC data 15:10:25 <kmlussier> dbs: And when you say they work well, is that because there is good coverage of the record? 15:10:43 <dbs> kmlussier: seems to hit the most important bases, yep 15:10:48 <kmlussier> OK, good. 15:11:15 <kmlussier> And we like the advanced search filters and the fact they make use of the MARC data. 15:11:19 <kmlussier> The reason I ask this question is because, if we make major changes to search, we want to make sure we don't lose the things that are already working well for us. 15:11:32 <kmlussier> So think of the question in terms of things you don't want to lose. 15:11:37 <miker> extensibility of search/filter configuration ... though "yay, lots of knobs!" / "boo, lots of knobs :(" 15:12:24 <kmlussier> So there are pros and cons to the amount of configuration we have in search. 15:12:52 <kmlussier> Is there anything Evergreen search has that you think makes it unique? 15:13:45 * miker wonders if he should cheat and pull from yesterday's log :) 15:13:52 <sarahchilds> It seems pretty rare to use fixed fields so extensively for filtering 15:14:24 <kmlussier> miker: No cheating allowed. 15:15:12 <kmlussier> Sorry, I purposely allow for quiet moments because it makes people think harder. 15:15:26 <miker> :) 15:15:36 <kmlussier> But I'll move on to the next topic. If you think of anything else while we're talking, feel free to shout it out. 15:15:38 <afterl> kmlussier: so looks like we'll be reading yesterday's log on our own .... homework 15:15:46 <miker> heh 15:16:02 <kmlussier> afterl: I'll be compiling the results at some point if you want the Cliff Notes version. 15:16:19 <kmlussier> #topic Areas for improvement 15:16:22 <afterl> kmlussier: yes, thanks. Cliff Notes work 15:16:29 <kmlussier> Where are the areas where you would like to see improvement? 15:17:43 <sarahchilds> I'd like to see search provide suggestions based on authority records 15:18:01 <Bmagic> #info Bmagic = Blake GH, MOBIUS 15:18:06 <kmlussier> sarahchilds: OK, can you expand upon that? Is this in keyword searching? 15:18:19 <kmlussier> Hi Bmagic! Welcome! 15:18:58 <afterl> I'd like a true relevance/popularity blended search. And yes, that may force kmlussier to explain what's coming down the road, but maybe that's not allowed here (not the time and place)? 15:19:10 <sarahchilds> Maybe. If you do a search for doctors, you could get a Did you mean: Physicians 15:19:39 <kmlussier> sarahchilds: Ah, I see. So it's a "Did you mean?" but leveraging authority data. Gotcha. 15:19:59 <sarahchilds> If you do a search for Samuel Clemens, it would suggest Mark Twain 15:20:15 <dbs> I'd like to see grouping of search results based on external links to, say, OCLC Work Entities 15:20:32 <kmlussier> To follow up on what afterl said, we do have activity metrics coming to search that afterl and I have been testing. But the blending of popularity with relevance is tricky. And we're still working on it / testing it. 15:21:42 <kmlussier> dbs: Can you give me an example of a search that would use that and how the info would be presented to the user? 15:24:18 <dbs> kmlussier: sure. A regular search for "Huckleberry Finn" would give you all of the hits for "Adventures of Huckleberry Finn" based on keywords 15:24:55 <dbs> but using links to OCLC Work Entities, you could group those records even if they were in different languages and the words appeared nowhere in the record 15:25:10 <kmlussier> Ah, nice! Thanks for the explanation dbs! 15:26:14 <kmlussier> Are there features you've seen in other search systems that you would like to see implemented in Evergreen? Things that haven't been mentioned yet. 15:28:04 <dbs> I wish we could offer a single search across not just books but blended with articles, items in our institutional repository, etc, but that's probably out of scope for Evergreen 15:29:09 <kmlussier> dbs: Well, for the purposes of this discussion, I don't think we should say anything is out of scope for the project, because if it's turns out to be something a lot of Evergreen users want to see, then maybe it becomes part of the scope at some point. 15:29:34 <kmlussier> dbs: But I also think a search like that needs to be handled well so that the user doesn't get lost in results. 15:30:12 <JBoyer> dbs: that could be part of a project making Evergreen easier to connect to a discovery system, which would probably touch on search. :) 15:30:33 <kmlussier> Are there other parts of search that just drive your users crazy? Or would you say that, overall, they are fairly happy with the existing search? 15:30:45 <kmlussier> JBoyer: That's a good point. 15:30:50 <kmlussier> Hi terran! 15:31:17 <terran> So sorry I'm late, it's my first day back in the office and I'm pulling my hair out! 15:31:45 <kmlussier> terran: We're just talking about ways we would like to improve search. Do you have any improvements you would like to see? 15:32:11 <afterl> This was talked about awhile back (we were going to do development but it never panned out), and Evergreen doesn't do this now, I don't think, and I will be embarrassed if it can now, but having the search terms highlighted in the results (on the display record) would be nice. 15:32:14 <terran> The main complaint we get is search speed. 15:32:33 <rhamby> overall my experience with users is that they are pretty happy with existing search and keyword search with the one major qualifier to that being that they want records to float the top based on activity (which is in progress) 15:32:50 <rhamby> with that said, many of these other ideas that have come out of these I'm sure would be welcome as well 15:32:52 <kmlussier> terran: Thanks terran! 15:32:54 <sarahchilds> Users hate when they do a search for a particular title and the item they are looking for shows up low in the results 15:33:46 <terran> Most of our users would also like the most popular records to float nearer the top on keyword or title searches. 15:33:55 <kmlussier> sarahchilds: So improvements in relevance. Do you find there are specific problems in relevance that cause those records to appear low in the list? 15:34:54 <terran> And of course, a good "did you mean...?" service to accommodate mis-spellings and related topics 15:35:06 <sarahchilds> One of the issues I get complaints about is that our users expect to be weighed more heavily 15:35:19 <kmlussier> terran / rhamby: On the popularity front, afterl had brought this up earlier, but there is an ongoing project as rhamby noted, but, from early testing, we're also looking at ways to better blend bib relevance with popularity. Which is tricky. 15:35:30 <sarahchilds> expect *title* to be weighed more heavily 15:35:49 <kmlussier> sarahchilds: Thanks! Then field-weighting is important. 15:36:13 <terran> Yes, I strongly agree with weighing title (and primary author) more heavily 15:36:29 <kmlussier> I know there is some ability to configure field-weighting in Evergreen, but I think implementation might not be consistent across the board. As my follow up on these chats, I hope to identify where more documentation may be needed too. 15:36:46 <rhamby> it is and relevance is going to take on different aspects to different audiences. a public library that heavily circulates new best sellers is going to be well served by new active titles floating the top. An academic library I would see as very different in how they want to determine relevance. 15:37:29 <sarahchilds> rhamby: agreed 15:38:51 <kmlussier> Things I'm seeing in areas for improvement, then, are "Did you Mean?", leveraging authority data in keyword searches, blending popularity with relevance, grouping of search results based on external links, improved speed, and improved relevance. 15:39:14 <kmlussier> Did I miss anything? 15:40:07 <kmlussier> If not, I'm going to move on to the next topic, which is something we were just discussion. 15:40:15 <kmlussier> #topic Defining relevance 15:40:45 <kmlussier> I often hear that people want to improve relevance, but, as rhamby just mentioned, what is relevant for one person may not be for another. 15:40:55 <terran> <admits to getting completely lost trying to understand field weighting in the current system> 15:41:03 <kmlussier> So I think it's important that we define what we mean by relevance. 15:41:29 <kmlussier> terran: That's a good point. So I wonder if there is a suggested improvement in there for making it easier / more understandable to turn the knobs? 15:41:52 <afterl> I would support that improvement 15:42:20 <kmlussier> What are the factors that you think are important for the system to consider when calculating relevance? 15:42:30 <kmlussier> A few have already been mentioned: field weighting, popularity. 15:42:31 <terran> field weighting: documentation on what the options mean along with a working example or two with detailed explanation would be most helpful 15:42:33 <kmlussier> Are there other factors? 15:43:23 <kmlussier> Or other specific areas where you see relevance falling short? 15:43:23 <terran> relevance: I think it might be useful to look at titles with the most copies being considered more relevant 15:43:44 <sarahchilds> terran: yes! 15:44:16 <kmlussier> terran: Ok, is that because you see that as one way libraries define that one record may be more important than others? 15:45:00 <sarahchilds> lots of copies would be one metric of popularity 15:45:19 <terran> yes - at least for large public libraries and fiction, the more popular an item is, the more copies there will likely be 15:46:06 <kmlussier> Thanks! 15:46:10 <terran> Titles with a large number of holds would also indicate popularity 15:46:16 <kmlussier> Any other factors you think are important for relevance. 15:46:50 <sarahchilds> Also in a consortium, even if everyone has only one copy, if most libraries own it, that indicates popularity as well 15:46:57 <Bmagic> I just caught up with the convo, forgive me, a call came in during this meeting. I have had problems getting the search results I want. Relevance is one reason, and title searches being so limiting is another reason 15:47:05 <terran> sarachilds: true! 15:47:10 <kmlussier> To give a little background on the activity metric, titles with a large number of holds and circs are all types of activity that scores a badge for a record. But I don't want to go to deep into the details here. 15:47:34 <kmlussier> It doesn't cover total # of copies, but you typically find that you get higher circs on those records with more copies. 15:47:36 <Bmagic> When I get 0 search results, I would like it to instead go ahead and do another similar search that will return something 15:47:38 <terran> kmlussier: yes, I'm really interested in that project that you're working on 15:47:57 <kmlussier> Bmagic: Can you tell expand upon what you mean when you say title searches are limiting? 15:48:24 <kmlussier> Bmagic: Also, I would be interested in hearing where you would like to see improvements in relevance. 15:49:17 <Bmagic> The title index isn't the same as keyword by design, but (also by design) it's not encompassing what some of us consider to be the title. I don't know, basically, it needs to "learn" from popular searches 15:49:18 <sarahchilds> Bmagic: I wouldn't want it to *do* the search, but I'd want it to suggest a meaningful alternative 15:49:39 <terran> Bmagic: I like that idea, or to at least suggest options for other searches when there are 0 results. 15:50:08 <kmlussier> Bmagic: Ah, that's interesting. So you're looking for something that can learn from user behavior? 15:50:28 <JBoyer> I realize this would require a ton of work for little visual improvement, but I'll mention it just so it's known: a re-architecting that removes the need for a core limit of records returned would be helpful. (i.e. we have a lib that can't be searched for a common keyword unless the core_limit is close to 80K...) 15:51:15 <JBoyer> "A ton of work" may be an understatement, also. :) 15:51:21 <Bmagic> I think that might be a possible solution. Search phrasing is remembered, and called back on future searches to help the next individual 15:51:29 <sarahchilds> JBoyer: Yes!!! Not returning any results when there are results to be returned is so not cool 15:51:37 <kmlussier> JBoyer: Thanks for that! But I don't want the "ton of work" ideas to go unmentioned, because maybe a ton of work is ultimately needed. Or maybe not. 15:52:06 <terran> JBoyer: I agree. 15:52:29 <kmlussier> OK, we're reaching the eight-minute mark, so I want to move on to my final question. 15:52:43 <kmlussier> #topic Top-priority improvements 15:53:01 <kmlussier> If you could only make two improvements to Evergreen search, what would they be? 15:53:10 <afterl> It's been said before, but SPEED 15:53:13 <kmlussier> Basically, what do you consider to be the most important issues? 15:53:13 <terran> 1) Speed 15:53:28 <terran> 2) did you mean? 15:53:36 <sarahchilds> Hard to argue with speed 15:53:57 <kmlussier> This is why we have 2 instead of 1. The 1st time around it was all speed. 15:54:12 <Bmagic> Speed, 0 or few results automatically expanded to popular related searches 15:54:13 <kmlussier> JBoyer++ # Suggesting that we let people pick two. 15:54:23 <afterl> And I really want to see that popularity/relevance blended search results 15:55:42 <kmlussier> sarahchilds: Do you have a #2? 15:55:44 <terran> Going back to earlier suggestions (since I was late) - I would also like a facet on the search results for audience type (is this possible in the current system already? I haven't delved into how facets work yet) 15:55:59 <sarahchilds> It's too hard to choose! I love them all :-) 15:56:02 <kmlussier> terran: If it is possible, I don't know how to do it. 15:56:34 <kmlussier> terran: Thank you for adding that suggestion! 15:56:55 <terran> Ooh, also a facet for fiction vs nonfiction 15:57:15 <kmlussier> Ok, I might wrap up a little early. But is there anything else you had hoped we would cover today that we didn't? 15:57:29 <kmlussier> terran: Yes, we also had one from a previous group on publicate date ranges. 15:57:35 <kmlussier> publication, even 15:58:34 <terran> One for language might be helpful for a lot of libraries, too 15:58:45 <kmlussier> OK, I'm going to wrap up the meeting, then, but let me know if anything comes to mind later on. 15:58:51 <kmlussier> Thanks everyone for sharing your ideas today! 15:58:55 <kmlussier> #endmeeting