Data filtering at Spotibo

Filters are handy, especially if you need to dig deeper into analysis without using spreadsheets. You can include or exclude pages and combine more filters and segment data according to your needs.

You can analyze pages by these filters:

Content
 Name Filter description All filter options
URL Filtering by URLs
  • Text of the URL address
  • Length of the URL
Title FIltering by title tags
  • Title text
  • Length in pixels or number of characters
  • Count of title tags
Meta Description Filtering by meta description tags
  • Meta description text
  • Length in pixels or number of characters
  • Count of meta description tags
Headings Filtering by headings, by all or only by H1
  • Heading texts
  • Count of headings
Links
 Name Filter description All filter options
Level Filtering by level of the URL in website structure
  • URL level (Level 0 is a homepage. Any higher number means lower level of a page in the structure.)
Incoming Link Texts or ALT Filtering by the number of all links incoming to the URL
  • Count of the incoming links
Incoming Links Count Filtering by meta description tags
  • Meta description text
  • Length in pixels or number of characters
  • Count of meta description tags
Incoming Crawlable Link Filtering according to whether a linking page to the URL is crawlable
  • Crawlability of a linking page
Link Count on URL Filtering by a number of all internal and external links found on the URL
  • Count of all internal and external links on the page
  • Count of dofollow or nofollow links

Redirects

Name Filter description  All filter options
Finally Redirected to Filtering by the final URLs of redirected pages
  • Text of the final URL address
Final Status Code Filtering by the final HTML status codes
  • Status code of the final URL – 200, 3xx, 4xx, 5xx
Redirect Chain Filtering by the first URL in the redirect chain between the first redirected URL and the final URL
  • Text of URL in the redirect chain
  • Status code of the URL in the redirect chain
Count of Redirects Filtering by number of redirects from the first to the final URL
  • Count of all redirects
Redirect Loop Filtering by possible redirect loops
  • True or false redirect loop

Technical

Name Filter description

All filter options

Search Engine Access Filtering by the access of any or a specific bot, e.g. Google, Bing, Yahoo, ASK, Yandex, Baidu or Seznam
  • Access directives (noindex, nofollow, nocrawl or others)
Canonical Filtering by a reference to a canonical URL or defined self-redirecting
  • Defined canonical in HTTP header or HTML
  • Text of a canonical URL address
  • Text of self-redirecting
File Size Filtering by file size in KB
  • Size of an image
Status code Filtering by HTTP status codes
  • Status code of an URL – 200, 3xx, 4xx, 5xx.
Content type Filtering by the type of content on the URL (e.g. text, image, application, pdf)
  • Image, text, application…
Charset Filtering by text coding (e.g UTF-8, UTF-16)
  • Text of charset type
Time of Last Crawl Filtering by the time of the last URL crawl with Spotibo
  • Exact time

All of the filters can be used differently according to their value in text and number. In the table below are a few possible examples and how to work with them.

Filter by

Example

In Spotibo

Text

Containing I need to search to see if all pages on a site have their brand in the title. Exclude Title Containing “name of my brand”
Exactly Matching I have to find pages with specific linking anchor text. Include Incoming Link or ALT text Exactly Matching “anchor text”
Matching RegExp (Java Regex) I need to select guide pages with titles containing “how to.” Include Title Matching RegExp “how to.*+guide”
Begins/Ends with I want to find out if a crawler found any pages on HTTP protocol on my secured site. Include URL Begins with “http://”

Number

(Element count and length in characters or pixels, link count…)

Equals I need to select only pages redirected by a 302 status code. Include Status Code Equals “302”
Less Than I want to identify pages with one or no incoming links. Include Incoming Links Count Less than “2”
Greater Than I need to find pages that are deep in the site structure. Include Lever Greater than “5”

Find out more about how you can use Spotibo tool:

Do you miss any cool functionality in Spotibo? Let us know at janko@spotibo.com.