Why Most CA Websites Are Invisible on Google

Try this. Open a new browser tab, search for the name of a CA firm you know — not "tax consultant in Bangalore", their actual firm name. Three times out of four, the result is some combination of: ICAI's member directory, a dead Wix page from 2018, IndiaMART listing, and possibly a JustDial entry. The firm's actual website? Page two. Sometimes nowhere at all.

This isn't because the website doesn't exist. It's because the website does four specific things wrong, all of which are fixable in an afternoon and three of which we now do correctly by default in AI4CA templates. We're going to walk through them in order of impact.

#1: There's no canonical URL, or there are two competing ones

This is the single biggest issue we see, and the reason ai4ca.in itself wasn't ranking until last week — the same bug bites everyone.

If your firm's website is reachable at both yourfirm.in and www.yourfirm.in and both serve identical content, Google sees them as two pages competing for the same query. Without a clear signal which is canonical, Google may split ranking between them, or pick the wrong one, or de-prioritize both.

The fix is one of two things, both supported by every web host:

  • A 301 redirect from one to the other (we redirect www to non-www by default in our templates)
  • A <link rel="canonical"> tag on every page pointing to the chosen version

You want both, ideally. The 301 is the strong signal; the canonical tag is the belt-and-braces backup.

How to check: open your firm's site with www. and without www.. If both load, and the URL bar doesn't redirect, you have this bug. About 60% of CA sites we've audited have it.

#2: The robots.txt file blocks the very sitemap it points to

This sounds absurd. It is absurd. We had it ourselves until last week.

A typical robots.txt has rules like Disallow: /admin/ (correct, blocks admin pages) and at the bottom, Sitemap: https://yourfirm.in/sitemap.xml. So far fine. The bug is when somewhere in between, there's a blanket Disallow: /*.xml$ — meant to block scrapers from grabbing exports — and it inadvertently blocks Google from reading the sitemap you just told it to read.

Google's behavior: it fetches robots.txt, sees the sitemap URL, tries to fetch it, gets blocked by the same robots.txt, gives up. None of your URLs get discovered through the sitemap. Google has to find them via links from other sites, which for most CA firms means it never finds them.

Check yours: visit yourfirm.in/robots.txt and look for any Disallow: /*.xml or Disallow: /sitemap. If present, remove. We removed it from our default templates last week.

#3: The sitemap exists but lists nothing useful

Even when the sitemap is reachable, it often lists only the home page. That's a sitemap of one URL. Google can technically work with that, but it has nothing else to crawl. Your "About", "Services", "Team", "Contact" pages — all need to be in the sitemap with realistic <lastmod> and <priority> values.

The pattern we ship in AI4CA templates:

<url>
  <loc>https://yourfirm.in/</loc>
  <priority>1.0</priority>
  <changefreq>weekly</changefreq>
</url>
<url>
  <loc>https://yourfirm.in/services</loc>
  <priority>0.9</priority>
  <changefreq>monthly</changefreq>
</url>
<url>
  <loc>https://yourfirm.in/about</loc>
  <priority>0.7</priority>
  <changefreq>monthly</changefreq>
</url>
... etc

The <priority> is a relative hint to Google about which pages matter most on your site. It doesn't affect ranking against other sites — only which of your pages get crawled first. Home and Services should be 0.9-1.0; Privacy and Terms should be 0.3.

<changefreq> is a hint about how often you update each page. Weekly for blog/home, monthly for static pages, yearly for legal. Be honest — Google notices if you claim "weekly" but the page hasn't changed in a year.

#4: There's no structured data telling Google what kind of business this is

This is the difference between Google understanding "this is a website with words" and "this is a CA firm in Bangalore named ABC & Associates that offers GST advisory and audit, run by partners X, Y, Z, with FRN 012345N."

The first lets Google index you. The second lets Google show you in rich results: the knowledge panel on the right when someone searches your name, the local pack on Maps, the "people also ask" sections.

The mechanism is JSON-LD — a script tag with structured data in your page's <head>. The relevant schema for CA firms is AccountingService (a subtype of LocalBusiness):

{
  "@context": "https://schema.org",
  "@type": "AccountingService",
  "name": "ABC & Associates",
  "telephone": "+91-80-12345678",
  "email": "contact@abcassociates.in",
  "address": {
    "@type": "PostalAddress",
    "streetAddress": "...",
    "addressLocality": "Bengaluru",
    "addressRegion": "KA",
    "postalCode": "560001",
    "addressCountry": "IN"
  },
  "areaServed": "India",
  "priceRange": "₹₹"
}

Most CA websites have none of this. Our templates ship with it filled in from the customizer's firm details — you fill in your firm name and address once, the JSON-LD generates correctly on every page.

What this gets you, in practice

After fixing the four issues above, the typical pattern we see for a CA firm website (ours and customer sites):

  • Week 1-2: Google re-crawls robots.txt and the sitemap. Indexed page count goes from "1 or 2" to "8-15" depending on how many pages the site has.
  • Week 2-4: Searches for your exact firm name start returning your website as result #1 (it was previously result #5-10 or absent).
  • Month 2-3: Long-tail queries like "GST consultant near Indiranagar" or "ITR filing CA Bangalore" start showing up in Search Console as impressions, sometimes converting to clicks.
  • Month 3+: If you've also been writing blog posts (cf. the post on auto-replies and broadcasts), those start ranking for the specific topics they cover.

The first two are essentially automatic. The third and fourth depend on having content; this is where our templates can only do so much — they give you the structural fix, you have to write the words.

What we do automatically in our templates vs what's still on you

We do, on every site deployed via AI4CA:

  • 301 redirect from www to non-www (or vice versa, your choice)
  • Correct robots.txt with sitemap reference and no self-blocking rules
  • Sitemap covering Home, About, Services, Contact, Privacy, Terms (and any blog if enabled)
  • JSON-LD AccountingService schema with your firm details
  • Correct canonical URLs on every page
  • Open Graph + Twitter Card tags for social sharing previews
  • HTTPS by default (Cloudflare's free SSL)

What's still on you:

  • Submitting the sitemap to Google Search Console (we can't do this for you — it requires your Google account auth)
  • Setting up Google My Business (similarly, your auth, your business)
  • Writing useful content — service pages with actual detail, blog posts, partner bios. This is where most CA sites fail. "We provide quality services" doesn't rank for anything.
  • Getting backlinks — being listed in CA association directories, local business directories, partner firm websites. Outside our scope but high-impact.

The four structural fixes above are the cost of admission. The next layer — content and backlinks — is where firms differentiate. But you can't differentiate on content if Google can't even find your sitemap, which is the bug most CA sites are stuck on right now.

How to check your existing site

If you have a website built somewhere else and you want to know which of these issues you have, three free tools answer the questions in five minutes:

  1. https://yourfirm.in/robots.txt — does it exist? does it have any Disallow: /*.xml?
  2. https://yourfirm.in/sitemap.xml — does it exist? does it list more than just the home page?
  3. Google Search Console "URL Inspection" tool — paste your home page URL. Does it say "Indexed" or "URL is not on Google"?

If you fail any of these, the issue is structural. If you pass all three but still don't rank for your firm name, the issue is content/backlinks — different problem, same first step (build a real website with real pages).


AI4CA templates ship with the four structural fixes above on by default. Pick a template, fill in your firm details, publish — the SEO scaffolding is correct from day one. Free tier on our subdomain (firmname.ai4ca.in), ₹399/month for your own custom domain. About 60% of customers report being indexed for their firm name within two weeks of going live.

Build your CA practice website in 10 minutes

Pick a template, customize, and publish under your own domain. No code, no monthly hostage fees.

Get started free