How do I know how many search engines have scratched my blog?

Among the statistical tools of wordpress StatTraq  There is an Iranian plugin. Among the features of this plugin is the Search Engine Saturation menu, where you can see how much the search engine has scraped your blog posts.



The principle behind this is simple.
Whenever a browser or search engine requests a page from the web server, it stores who it is in a field called user_agent and sends it. Therefore, if statistics are given by user_agent out of the total articles, you can see how many articles have been crawled by user_agent.

You just need to know the name of user_agent. Google is Googlebot, Yahoo! is Inktomi/Yahoo, and MSN is msnbot. Naver is Bot[NaverBot-1.0 (NHN Corp. / +82-31-784-1989 /

Comparison of crawl status of Google, Yahoo!, and Naver after blog openingI tried, but if I open a blog and leave it alone, Naver does not seem to be scratched.

It has been a long time since I created the blog, but there were no visitors from Naver, so I tried to modify the source of StatTraq by knowing the name of Naver user_agent to find out how much Naver scraped my blog posts.

StatTraq basically shows the values ​​of google, yahoo, and msn search engines, but if you simply modify the source, you can see Naver as shown in the picture.



Naver's search engine doesn't seem very diligent. Or maybe it wasn't worth scraping my blog either.

In case the StatTraq source is upgraded later, the modified place is as follows.

You can do this by editing the search_engine_stats.php file. This file is located in the wp-stattraq/reporter directory where you installed StatTraq.

1. At line 65, add the following code to specify the user_agent name of Naver to be searched.

$naver = getPageDBResults(“Bot[NaverBot-1.0 (NHN Corp. / +82-31-784-1989 /”, $date_format, $time_frame, $betweenClause, $orderBy);

2. Add the following source code to initialize the neighbor number and count the number.

$naver_count = $naver->cnt;

3. Modify the following to display the results


echo ' Number of Pages Indexed ' . $google_count . ' ' . $yahoo_count . ' ' . $msn_count . ' ' . $naver_count . ' ';
echo ' Number of Pages Not Indexed ' . ($total – $google_count) . ' ' . ($total – $yahoo_count) . ' ' . ($total – $msn_count) . ' ' . ($total – $naver_count) . ' ';
echo ' Percent Saturation ' . floor(($google_count/$total)*100) . '% ' . floor(($yahoo_count/$total)*100) . '% ' . floor(($msn_count/$total)*100) . '% ' . floor(($naver_count/$total)*100) . '% ';



Rather than writing the modified parts part by part and following along to fix it, if you just want to get a whole file and edit it without thinking, you can overwrite the source that I have modified as follows.

1. Back up search_engine_stats.php in the directory (wp-stattraq/reporter) of the web server where StatTraq is installed. 2K download and overwrite the original file.

To see how many search engines have scratched your blog, go to the menu SE Click on Saturation to see it.



0 If you like the article, please click the heart~ It will be a strength to bloggers (SNS/login/advertising is not related)

Articles you might like

Health to Calendar

Import your Apple health records into your calendar events and view your health records chronologically on your calendar


Add a Comment

Email addresses are not disclosed. Required items *is indicated by

This posting is part of Coupang Partners' activities, and a certain amount of commission is provided accordingly.