You need to sign in to do that
Don't have an account?
sam samm
Help with robot.txt
Google is not showing the website description even though i added the robot text allowing everything.
This is what google is showing:
A description for this result is not available because of this site's robots.txt – learn more.
also this is the robot txt file i am using:
User-agent: *
Disallow:
Any idea ..
This is what google is showing:
A description for this result is not available because of this site's robots.txt – learn more.
also this is the robot txt file i am using:
User-agent: *
Disallow:
Any idea ..
1. Install the force.com ide
2. Create a Visualforce Page that represents the robots.txt you would like
<apex:page contentType="text/plain" showHeader="false">
User-agent: msnbot
Disallow: /
</apex:page>
3. Add the following to your package.html
<types>
<members>*</members>
<name>CustomSite</name>
</types>
4. "Refresh from Server" before you can see the sites.
5. Open up your site under the "sites" folder and add in a "robotsTxtPage" entry and save.
<?xml version="1.0" encoding="UTF-8"?>
<CustomSite xmlns="http://soap.sforce.com/2006/04/metadata">
<active>true</active>
<authorizationRequiredPage>Unauthorized</authorizationRequiredPage>
<bandwidthExceededPage>BandwidthExceeded</bandwidthExceededPage>
<changePasswordPage>ChangePassword</changePasswordPage>
<fileNotFoundPage>FileNotFound</fileNotFoundPage>
<inMaintenancePage>InMaintenance</inMaintenancePage>
<indexPage>IdeasHome</indexPage>
<masterLabel>First</masterLabel>
<portal>Customer Portal</portal>
<requireInsecurePortalAccess>false</requireInsecurePortalAccess>
<robotsTxtPage>RobotsTxt</robotsTxtPage>
<siteAdmin>mylogin@myco.com</siteAdmin>
<subdomain>mydomain</subdomain>
</CustomSite>
6. Now http://mydomain.force.com/robots.txt will contains the description of your page
You can also use your own custom favorite icon. Here are the steps:
1. create 16x16 icon
2. upload as a static resource and make sure cache control attribute is set as public
3. Disable the standard header in your visualforce page and reference the favorite icon static resource in your page as following:
<apex:page showHeader="false" >
...
...
<link REL="SHORTCUT ICON" HREF="{!URLFOR($Resource.favicon)}">
.....
</apex:page>
Please use following in your VF page for allowing all bots to crawl your website.
<apex:page contentType="text/plain" showHeader="false">
User-agent: *
Disallow:
</apex:page>
Please use following link to submit your website to bing:
http://www.bing.com/toolbox/submit-site-url
Following Robots Database lists robot software implementations and operators.
http://www.robotstxt.org/db.html
Changing Bing's Crawl Settings
Microsoft allows you to set the crawl rate for your site and even allows you to specify the crawl rate by hour of the day (a crawl pattern). This allows you to specify that Bing crawl your site more aggressively during non-peak hours and less aggressively while a majority of your visitors are online.
Note: Before specifying a crawl pattern, you may want to view your web statistics in cPanel to see when most of your visitors are online.
To change Bing's crawl settings:
Please refer following links for more information on bing search :
http://www.bing.com/blogs/site_blogs/b/webmaster/archive/2012/05/03/to-crawl-or-not-to-crawl-that-is-bingbot-s-question.aspx
Also you can use following link for generating robot.txt file.
http://www.mcanerin.com/en/search-engine/robots-txt.asp
Also robot.txt file can be checked at :
http://phpweby.com/services/robots
http://www.frobee.com/robots-txt-check
http://webmaster.yandex.com/robots.xml
I hope this helps.
Thanks,
Amit Bhardwaj
Also my website contains some sensitive user data.
Can i gave permission just to one page in the whole org rather than allowing everything.
waiting for the answer eagerly.
I just want crawler to access this page and not the pages after user logged in and there data.