What Are Orphan Pages & How Do I Find Them?

broken link

Do you have pages with the potential for ranking and organic search traffic but aren’t part of your site structure? Or pages that aren’t supposed to be in your site structure, but Google finds them anyway?

The answer is most likely yes. It certainly is the case for the majority of websites!

These are known as orphan pages, and re-associating the good ones with your website structure helps you fully utilize their potential (as does blocking search engine bots access from your low-value ones!).

What are orphan pages?

Orphan pages are those on your website that are not linked to from anywhere else on your site. This means that a userr isn’t sure where to go from there for more info because your page isn’t interlinked to other parts of your website. These pages aren’t indexed, so search engine crawlers can’t find them.

They represent lost opportunities to attract and engage customers, which might negatively impact your bounce rate. We do not encourage losing retention, page traffic, and revenue, as well as jeopardizing your SEO success due to orphan pages. Crawlers need to be able to find your pages if they are linked to other pages.

Consider it like a spider web that a spider can crawl on. The spider will have difficulty moving from one location to another if sections of it are broken. When this happens with clients, they will not stay on your orphan page; instead, they will depart entirely. Avoid Unlinked pages at all costs.

Orphan vs. Dead-end pages

Before we go into orphan pages, let’s clarify the differences between two SEO terms that can be misunderstood.

As previously stated, an orphan page is not linked to or reachable from any other page on the same website.

On the flip side, a dead-end page is a web page that does not link to any other internal or external web pages, resulting in a “dead end.”

When visitors arrive at this page, they have the option of pressing the back button or leaving the site.

When search engine crawlers arrive on the dead-end page, they reach a dead-end and no link equity can be passed.

A dead-end can be readily solved by adding connections to your on-page content or ensuring that sidebar or footer navigation is present on every page.

How do I locate orphan pages on a website and how do I fix them?

Orphan pages are classified into two types:

  1. The expected orphan pages, which you should not be concerned about.
  2. The pages that should not be orphan pages, which you should be aware of.

Their type will determine the path you follow to repair your orphan pages. So, when we notice a significant volume of orphan pages, the first thing we do is look at what they look like and whether they are expected or not.

Expected orphan pages: not usually a reason for worry

 After doing a site crawl and comparing it to your server log files, to find pages Google can find but aren’t in your site structure, then you can click on “found by Google” to receive a list of all your orphan pages.

Many of these orphan pages will be generated by:

1. Pages that do not already exist on your site but are linked to by another site. It is usual to receive an external link to a page, which you then remove or redirect. Google will still detect the old link because it still exists on the other website.

How to fix: since  you have no control over the links on other websites, the only option to resolve this type of orphan page is to contact the site owner and request that they update the page to the correct new location.

2. Pages that return non- 200 status codes. Google may continue to crawl pages that produce 4xx status codes even after being updated on your site.

How to fix: Google will eventually stop crawling these pages. There is nothing to be concerned about.

3.   Expired pages: This is prevalent on websites with a large number of short-lived pages, such as classified ads that expire quickly.

How to fix: We should only be concerned about expired pages discovered by Google if they have been orphaned for an extended period. Otherwise, the number of orphan pages indicates the website’s page rotation rate and should be regarded as food for thought.

Unexpected orphan pages: cause for concern?

1. Expired pages still returning content Some websites stop linking to expired material (such as products withdrawn from the catalog) and fail to return a status code (such as HTTP 404 or 410) indicating that the content is no longer available. As a result, the previous page is still accessible.

How to fix: In addition to eliminating links to expired material, ensure that the expired page is updated with the correct status code. Make sure to 404 or 410 the content if it is no longer available.

2. Pages left out of a prior site migration: These are pages not redirected; thus, old content may still be visible.

How to Repair: If your new website contains similar material, you should redirect these old URLs to it. If there isn’t, these outdated/omitted pages should produce a 404 or 410 status code.

3. A syntax error while building sitemaps: This results in incorrect URLs that can still return content and  create duplicates or return HTTP errors.

How to fix: If you discover incorrect URLs caused by a syntax problem, work with your development team to find a solution.

4. A syntax error occurred while creating canonical tags: These result in erroneous URLs. These URLs could be serving status codes 200 OK or error codes.

How to fix: If you discover incorrect URLs caused by a syntax problem, work with your development team to find a solution.

5. Important, high-quality pages that aren’t linked in your website structure. Some websites use navigation pages ( essentially content lists like category pages or internal search result pages) that are only linked if one or more criteria are met.

How to fix: The correct technique is to decide when a page no longer meets your business criteria to be a target for organic traffic and then remove it once and for all: remove links and return HTTP 404 or 410. Until that time, it should be linked to someplace on the website.

1) Get a complete list of all of your current website pages.

Using your favorite website audit tool solution and expecting it to find orphan site pages will not work because orphan page, by definition, are not connected to any domain page. They will never be discovered by the crawler. Instead, you must specify the entire list of site URLs to be examined by the crawler. There are several methods for obtaining the URL list:

 Use your sitemap file

The sitemap is a file that is normally placed at the root of your domain to assist search engine bots in understanding the content of your site;  like how frequently it is updated, and how to display your material on search engine results pages effectively, or SERPs.

Obtain a list of site URLs

If a sitemap isn’t an option – for example, if the sitemap doesn’t include the entire page list – you can construct the list from your CMS. Installing a lightweight plugin, such as List URLs, on WordPress, for example, allows you to export a list of site URLs as a CSV file

Or using our link audit tool, you can go to the main Summary section and quickly identify any orphan pages on your site.

Google Analytics

If your web pages have Google Analytics installed, your Google Analytics data may be the ideal location to start looking for orphan pages on your website.

Start by compiling a detailed list of URLs on your site, which can be found in the left sidebar of your Google Analytics account. Click on “Behavior,” select “Site Content,” and finally click on “All Pages.”

Because orphan pages are notoriously difficult to locate, it’s safe to assume that the number of times your target audience has visited them is comparatively low.

So, in your Google Analytics tool, click on “Pageviews” so the arrow points upward, and the tool will provide a list of your site URLs from least viewed to most viewed, moving OPs to the top of the list.

 To make your list really comprehensive, set the date range – situated in the top right of Google Analytics – and set the date to when you started building your website to ensure that your list is as thorough as possible. Then, press the “Apply” button:

Next, expand the URL list by clicking on “Show rows” in the bottom right and selecting 5 000 from the dropdown menu:

Of course, if you have more than 5,000 web pages, you will have to export per batch until you get all Google Analytics visits data for your complete website.

However, because you will be searching for orphan web pages from least to most visited, your list will most probably include all orphan pages in only a few batches.

Once you’ve loaded all of your URLs, click “EXPORT” in the top right-hand corner of Google Analytics and choose “Google Sheets” or any other file format you’re comfortable exporting your list with.

After you’ve imported your URL data, it’s time to search for orphaned website pages.

You can discover and address any orphan pages on your site using a simple 5-step process:

  • Get a full list of your current website pages.
  • Run a website crawl for pages with zero internal inbound links
  • Analyze the audit results
  • Resolve any orphan page found
  • Rerun the audit periodically to catch new unlinked pages

Let’s take a quick look at each of these processes.

2) Run a website crawl for sites that have no internal inbound links.

Set up the audit rule to catch pages that lack at least an inbound internal link to identify orphan pages. Set up a recurring crawl while configuring the audit to catch new unlinked pages in the future. It’s worth noting that if you’re dependent on a URL list, you’ll need to acquire an updated list from your CMS.

3) Analyze the audit results.

Analyze visits, traffic sources, and page views, as well as entry and exit behaviors using your web analytics solution. In the sample below, we have a sample of a campaign page assisting in traffic acquisition for a set period. When the campaign is over, the page no longer draws traffic and can be taken down

4) Resolve any orphan pages discovered.

Once you understand the orphan page’s function and how it contributes to your website’s  marketing goals, you may decide what action, if any, to take with the page:

  • Link to it from other internal pages if site visitors must find it via browsing
  • Archive it if it’s no longer needed
  • Leave it as-is if it’s serving a business need that doesn’t require internal linking to the page.

There are two frequent reasons for orphan pages that should be addressed and resolved as soon as possible.

Both of these causes are essentially page duplicates that should always redirect to the same URL.

If they don’t, certain versions of the page are likely to be orphaned because they aren’t linked.

The fact that they are orphans isn’t the main issue in this situation; it’s the fact that they are duplicates.

These may come up later while searching for orphan pages and must be dealt with, so it’s a smart option to get them out of the way before they do.

Non-Canonical www/non-www or https/http 

Every public page on your site should generally use http or https (preferrably https) consistently, as well as www or non-www consistently.

To see if this is the case, enter all of the following variations of your site’s homepage into your browser:

  • https://www.example.com
  • http://www.example.com
  • https://example.com
  • http://example.com

All four versions should immediately lead to the same URL.

That page should be canonical to itself for consistency.

If one of these versions fails to redirect successfully, it may hint at larger issues on the site.

Examine other URLs with that variation to see if it’s a more prevalent problem.

You should test a couple more pages on your site and verify your site’s.htaccess file to ensure that redirections for these are properly configured.

Here’s  how to force https in . htaccess. If you do this, ensure that every page on your site supports SSL, or your users may see a scary browser warning.

Here’s how to force  www or non-www. Check once more that this will not cause any server issues.

Trailing Slashes

Another issue to keep an eye out for is the employment of trailing slashes regularly.

These two URLs, for example, may deliver the same content, but they are not identical:

  • https://example.com/page1/
  • https://example.com/page1

Examine a few pages on your site, both with and without the trailing slash, to ensure that they automatically redirect to the same URL and do so consistently.

Check that this is set up properly in .htaccess.

Here’s how to force a trailing slash in.htaccess, 

5) Rerun the audit periodically to catch new orphan pages.

SInce pages can become orphaned over time – by adding new content and failing to link to it, or by mistakenly removing links to pages buried deep in the site hierarchy – it is critical to regularly review the site for new issues. As previously stated, you may enable Linkilo to rerun your audit regularly by scheduling a crawl

Why are orphan pages bad for SEO?

Orphan pages generate two major SEO issues:

1. Low Rankings & Traffic: Even if they have amazing content, orphan pages seldom rank well in SERPs or receive a lot of organic search traffic.

2. Crawl Waste: Low-value orphan pages (such as duplicate pages) can divert crawl budget away from vital pages.

When orphan pages account for a sizable portion of the pages Google explores on your website, such as more than 70% in the example below, you get a decent picture of how serious the situation is.

How to avoid having orphan pages

Naturally, you don’t want to remove old postings because you want to keep their SEO trust. At the same time, you want the old content to be client-facing, rather than converting them to Orphan Pages, it is preferable to link to them elsewhere on the website.

What to do with unwanted pages

Putting non-client-facing pages in the footer has proven to be a smart strategy in the past. Contact Us,  Privacy Policy, and other sections are frequently found in the site footer.

Click here to learn more about how many internal links you should have on each page.

This assures that the link appears on every page of the website, but users rarely, if ever, notice the footer.

Furthermore, an archive homepage can be built, displaying all of the undesired portions of the website and links, similar to a regular HTML sitemap page. Orphan Pages are avoided in this manner, while current pages continue to acquire SEO trust.

Important takeaways

Remember that, while orphan pages may not be a major issue,  you should not be careless and leave your web pages alone if you want to deliver a pleasant user experience to your site visitors and enable search engines to index the most significant pages on your website.

So, when managing orphan web pages, keep the following in mind:

  • Eliminating orphan pages can improve your SEO. 
  • Do not confuse orphan pages and dead-end pages.
  • You can use other tools, such as Google Search Console, Raven Tools, SEMrush, Moz,  Link Explorer, and Ahrefs when identifying your orphan.
  • You can use Google Analytics to find orphan web pages.

If you currently have a WordPress site, It is best to use our WprdPress Plugin to fix your orphan pages quickly. Or you can use any SEO Audit tool to help identify orphan pages or consult with an SEO expert for best advice.

© Copyright Linkilo.co 2021. A Product by  SEO RANK SERP LLC | Privacy Policy | Terms and Conditions