TL;DR
Googlebot now fetches only the first 2MB of HTML pages (excluding PDFs), meaning critical content below this threshold could be missed if your page is too large.
Key points
- 1
The 2MB HTML limit: Googlebot fetches up to 2MB of HTML per URL (excluding PDFs), which includes HTTP headers but stops at this cutoff. This means if your page exceeds 2MB, Googlebot ignores everything beyond that point—critical content like structured data or meta tags could be missed. For example, a page with inline base64 images or massive CSS/JavaScript blocks might get truncated, making your site less visible in search results. To avoid this, keep your HTML lean by moving heavy assets to external files and prioritizing critical elements like meta tags near the top of your page.
- 2
How resources are handled: Resources like images, videos, and fonts are fetched separately by Googlebot with their own byte limits, not counted toward the parent page's 2MB cap. This means a page with large media files won't be truncated by the HTML limit itself. However, if your page includes bloated inline assets, the 2MB cutoff could still cut off essential content. For instance, a page starting with a 1MB menu block might lose its title tag and structured data below the cutoff. Always check server logs for slow responses, as Googlebot may back off if your server struggles to serve bytes quickly.
- 3
The WRS rendering process: After fetching the HTML, Googlebot passes it to the Web Rendering Service (WRS), which executes JavaScript and CSS like a browser but operates statelessly—clearing local storage between requests. This means dynamic elements relying on user sessions or local storage won't work as expected. For example, a login-dependent page might render differently for Googlebot than for real users. To mitigate this, ensure critical functionality loads early in the HTML, and avoid heavy client-side logic that depends on stateful interactions.
- 4
Practical fixes for your site: To stay under the 2MB limit, move heavy CSS and JavaScript to external files, place critical elements like meta tags and structured data near the top of your HTML, and monitor server response times. If your page is slow to serve bytes, Googlebot may reduce crawl frequency, so optimize server performance. For example, a site with a 1.5MB HTML file and 500KB of inline CSS would be truncated at 2MB, but external CSS would be fetched separately and not count toward the limit. Regularly test your page's size using tools like Google Search Console to ensure key content isn't lost.
Share this update
This is a summary of an official post from the Google Search Central Blog, provided for quick reading. Google and the Google logo are trademarks of Google LLC; My Tool Studio is not affiliated with Google. Always refer to the original announcement for authoritative guidance.