log analysis

jtrant's picture

search term analysis: percentages of term use in a hitwise report

The question of what makes a 'meaningful' number of queries of an on-line resource has been lingering in the back of my mind ever since i did the analysis of the Guggenheim Museum search logs last fall. At issue, really, is how to profile and analyse the long tail of user searching. (There was a D-Lib article about this not long ago, that talks about ways to analyse the nature of the tail.)

What brought this back to mind was a Hitwise Newsletter report that included the following analysis of terms that contain 'summer'.

Search Terms Analysis: Search Analysis- "summer" Search Term Analysis
Most popular keywords containing the term "summer" for the 4 weeks ending 05/19/07

jtrant's picture

searching museum collections on-line – what do people really do?

thumbnail of search term frequency graphi've recently taken a look at a year's worth of search log data from the Guggenheim Collection on-line -- a pilot study for some work within the steve.museum project. I've attached a draft paper to this post -- comments are welcome! It's still rough in spots, but I need to step back.

One of our premises in discussing folksonomy in the museum is that allowing users to tag collections will improve their retrivability... but surprisingly, we know almost nothing about what searchers of museum collections really do. i couldn't find a single serious IR study in the museum domain. There's lots of literature about what we 'should' do, how standards will help and why controlled vocabularly is really important, with almost no evidence to support those claims. We need to look hard at the data.

Notable findings in the Guggenheim data:

Syndicate content