Home>Knowledge Base>Web Hosting>Statistics>Understanding Data Mining
Information
Article ID55
Created On10/15/2008
Modified10/15/2008
Understanding Data Mining

Data Mining is different than other reports. Whereas Custom and Standard Reports provide a general overview of the various statistics available from your log files, Data Mining takes this general information a step further and provides you with fully-detailed reports that allow you to drill-down and analyze your statistical data. These reports are priceless for tracking customers through your ordering process, for seeing which marketing campaigns are providing the highest return on your investment, or for a variety of other tasks requiring a large-scale, fully detailed analysis of your website traffic.

You can begin a data mining session by clicking on the sidebar Data Mining link.

What is the daily activity that this file gets?

This query allows you to select a single page or resource contained within your website and see the daily activity over time. This type of mining is good to identify trends in the activity of that resource, to see if people are accessing the file more or less than they used to.

What are the referrers of this file?

This query is usful for marketing, or to detect image leaching.

For marketing, if you have an image that you have exposed on other web sites that resides on your web server, the URL it was viewed on is recorded every time a browser views your image. This report will list the top URLs or "Refferals" in which your image was viewed.

If you have a nice graphic, chances are that someone else will want to use it on their web site. If they link your image path to their web site, this report will list the top URLs that the image or graphic is viewed from.

What sites were people on before they hit this file?

This query reports the original source of visitors that came into your site that eventually hit the page or file you choose. This can be most useful for marketing purposes. For example, if you have a "Thank You" page after a person orders your product, you can identify the original sources of visitors that reached that page, effectively discovering where your orders came from.

Which visitors used the most bandwidth by downloading this file?

This query allows you to track the bandwidth usage, total bandwidth usage, and the IP address of requests for specific files you have placed on your website. This report is most useful in detecting web site abuse. Typically, you will identify a high-bandwidth file from Top Files or a related report item, then start data mining on it to discover if there are a few IP addresses that are attempting to use your bandwidth by repeatedly downloading files. Those IP addresses can then be metered or denied access on your web server.

Where did people who hit this file enter the site?

This query allows you to select a single page contained within your website and see the top pages used to access the page. This is a particularly useful form of data mining that allows you to track where people are coming from when they access a specific page. For example, if a page is specified in an advertising campaign or marketing campaign, users can see exactly how people are accessing that page, and verify any information that is given by the particular site that contains the advertising.

Where did people exit the site after hitting this file?

This query allows you to select a single page contained within your website and see the last pages people viewed before exiting the website. This is useful in order to track how far users get in a process (such as placing an order), and where they are exiting your site.

How many times did people return to my site before hitting this file?

This query allows you to select a single page contained within your website and see the number of times a person visited the site prior to reaching and viewing the page selected. The report itself shows the IP address of the visitor, the initial Entry Page for that visitor before they viewed the page selected, and the number of visits before the page selected was viewed.

What query strings were passed to this file?

This query allows you to view the various query strings that are passed to a particular page of your website.

This query differs from the Query Items query in that it does not separate query string items. For example, assuming you have a hit that looks like /help/default.aspx?section=Administrator&page=Getting+Started. The Query Items report will show two lines: One for section=Administrator and one for page=Getting+Started. Conversely, the Query Strings report will show one line that looks like section=Administrator&page=Getting+Started.

What query items were passed to this file?

This query allows you to view the various query string items that are passed to a particular page of your website.

This query differs from the Query Strings query in that it separates each query string item and reports them individually. For example, assuming you have a hit that looks like /help/default.aspx?section=Administrator&page=Getting+Started. The Query Items report will show two lines: One for section=Administrator and one for page=Getting+Started. Conversely, the Query Strings report will show one line that looks like section=Administrator&page=Getting+Started.