For as long as I have been a web developer with CETIS we have relied on analysing server logs to give an indication of traffic sources and visitor trends. This approach existed long before I joined CETIS and seemed like a logical way of doing things, CETIS has had many web servers and many different developers have installed different tools and resources and since they were all using the same servers and producing the same style logs it has been a reasonable method of producing comparable stats.
While this method of collecting stats has stayed the same over the life of CETIS, the direction of CETIS and the environment that it finds itself in has changed over time and a need for a new strategy has become apparent.
Challenges from JISC CETIS and the environment
- JISC CETIS is more distributed from a technical point of view
Historically CETIS has had access to physical servers that sat in a server room somewhere in a University. A recession later and shifts in University policies mean that the abundance of resource is no longer available. While there are lots of external providers are happy to help you produce a flexible service and tie you into their hosting packages it does raise issues. Do we have access to server logs? Are the logs the same? If not then are the stats produced similar to the stats package we use? Can we even produce stats?
Similarly JISC CETIS is moving away from bespoke code when there are popular services that do the same thing and this raises similar questions. What stats do the services produce, are they comparable with other services, is there an API and will we have to pay to access what we’ve collected down the line.
- JISC CETIS is more distributed from a people point of view
Staff in JISC CETIS are technologically savvy and have our opinions on the services and techniques that we like. While I think it is a good thing to have such a technically diverse organisation trying new and exciting things it is also a problem from a stats analysis point of view. Are staff hosting their blogs, events and resources on cloud services and if so how do we measure the use of these resources?
- A call for more sophisticated analytics
- Google Analytics is more intelligent when it comes to what is and isn’t a visitor or a bot
- The hacks for Google Analytics to track binary files and RSS are not very good.
A hybrid solution
Finally I think that as organisations become more distributed and stats become more personal a web analytics strategy becomes more of an individual responsibility. I’m not quite sure what an effective strategy where analysis of individuals resources trends is helped to steer the organization as a whole would look like.
More to come…