Sitemap Generator |
|
|
|
|
Sep 9 2008, 03:57 PM
|
Group: Verified NS Member
Posts: 117
Joined: 20-August 08
Member No.: 2,051

|
Can anyone recommend a specific sitemap generator? I want to submit it to Google, but would feel better if someone pointed me in the right direction.
|
|
|
|
|
|
|
|
Sep 10 2008, 06:36 PM
|
Group: Verified NS Member
Posts: 225
Joined: 14-August 08
Member No.: 1,942

|
I used this free tool http://www.auditmypc.com/free-sitemap-generator.aspon recommendation of somebody here. works great, found a couple of broken links that I was able to fix quickly. The only issue I had is that it wouldn't export the xml file on my vista/ie7 machine. I had to export it to a different computer on my network and then upload it from there. Worked fine in the end. Took about a half hour to generate btw. let google know where it is, and yahoo too. Neither had any problems with it! good luck!
|
|
|
|
|
|
|
|
Sep 11 2008, 12:38 PM
|
Group: Verified NS Member
Posts: 210
Joined: 27-June 08
From: em thar hills in Virginia
Member No.: 1,342

|
You can use Gsitecrawler. Its free also. QUOTE (reneewood @ Sep 9 2008, 05:15 PM)  Can anyone recommend a specific sitemap generator? I want to submit it to Google, but would feel better if someone pointed me in the right direction.
|
|
|
|
|
|
|
|
Sep 11 2008, 12:55 PM
|
Group: Verified NS Member
Posts: 133
Joined: 2-June 08
Member No.: 1,218

|
QUOTE (Wackyjazz @ Sep 11 2008, 12:56 PM)  You can use Gsitecrawler. Its free also. How do you upload the icon that is viewable in your URL?
|
|
|
|
|
|
|
|
Sep 11 2008, 02:45 PM
|
Group: Verified NS Member
Posts: 228
Joined: 3-September 08
Member No.: 2,275

|
QUOTE (americangamingsupply @ Sep 11 2008, 02:13 PM)  How do you upload the icon that is viewable in your URL? It is called "favicon" you should name it favicon.ico and upload it to your file manager root directory
|
|
|
|
|
|
|
|
Jun 17 2009, 12:57 PM
|
Group: Verified NS Member
Posts: 33
Joined: 25-September 08
Member No.: 2,541

|
I posted this in another forum, but I will post it here as well. I use: http://www.xml-sitemaps.com/From the results, I download 4 files: XML, ROR, TEXT, HTML I upload all 4 of these to the file manager. Before doing that, I delete from the file manager the previous 4 files that I had uploaded. I also delete the 4 files from my computer so that the new 4 files are not renamed. After I upload all 4 files to the file manager, I go to Google Webmaster Tools and I resubmit (upload) the XML sitemap. I do all of this every time I add products to my website. It also seems to trigger Google to re-crawl my website.
|
|
|
|
|
|
|
|
Jun 17 2009, 02:39 PM
|
Jedi Master
Group: Verified NS Member
Posts: 1,142
Joined: 10-August 07
From: Galaxy Far, Far Away...
Member No.: 13

|
Thanks gymbratleotards for a very detailed explanation. I wanted to chime in here with an additional detail that our software creates an automated sitemap file that you can use to submit to Google Webmaster Tools. It's in your root directory - http://www.example.com/xml-sitemap.ashxThis is an XML formatted sitemap. You can submit multiple sitemaps to Google for the same site. As an aside, I'm curious if anyone uses their sitemaps on Yahoo with - http://siteexplorer.search.yahoo.com
|
|
|
|
|
|
|
|
Jun 17 2009, 03:20 PM
|
Group: Verified NS Member
Posts: 133
Joined: 2-June 08
Member No.: 1,218

|
QUOTE (ArcoJedi @ Jun 17 2009, 02:57 PM)  Thanks gymbratleotards for a very detailed explanation. I wanted to chime in here with an additional detail that our software creates an automated sitemap file that you can use to submit to Google Webmaster Tools. It's in your root directory - http://www.example.com/xml-sitemap.ashxThis is an XML formatted sitemap. You can submit multiple sitemaps to Google for the same site. As an aside, I'm curious if anyone uses their sitemaps on Yahoo with - http://siteexplorer.search.yahoo.com/xml-sitemap.ashx does it make changes when we make changes to website? How does it work?
|
|
|
|
|
|
|
|
Jun 17 2009, 03:24 PM
|
QA
Group: Administrators
Posts: 1,864
Joined: 10-August 07
Member No.: 6

|
QUOTE (americangamingsupply @ Jun 17 2009, 03:38 PM)  /xml-sitemap.ashx does it make changes when we make changes to website? How does it work? It works automatically, and updates when you update anything.
|
|
|
|
|
|
|
|
Jun 17 2009, 04:09 PM
|
Group: Verified NS Member
Posts: 238
Joined: 24-October 08
From: Pittsburgh, PA
Member No.: 2,842

|
QUOTE (ddavisNS @ Jun 17 2009, 04:42 PM)  It works automatically, and updates when you update anything. I don't see this specific file in my root directly. The sitemap I generated is in my root, but not one with an ashx. Why don't I have one and how can I get one? Thank you
|
|
|
|
|
|
|
|
Jun 17 2009, 04:19 PM
|
QA
Group: Administrators
Posts: 1,864
Joined: 10-August 07
Member No.: 6

|
QUOTE (lynn44 @ Jun 17 2009, 04:27 PM)  I don't see this specific file in my root directly. The sitemap I generated is in my root, but not one with an ashx. Why don't I have one and how can I get one?
Thank you It is system generated and not directly accessible. You can't edit it, configure it, or look at it in the file manager. It is accessed at www.yourdomain.com/xml-sitemap.ashx All sites on version 7.x have it.
|
|
|
|
|
|
|
|
Jun 17 2009, 06:07 PM
|
Group: Verified NS Member
Posts: 210
Joined: 27-June 08
From: em thar hills in Virginia
Member No.: 1,342

|
Hi Dave, You might want to take a peek at my .ashx sitemap. This auto sitemap has only a fraction of the url's listed. Example: I have 817 products, yet the .ashx sitemap shows just 628 urls. Why is there such a huge differance in numbers? QUOTE (ddavisNS @ Jun 17 2009, 04:42 PM)  It works automatically, and updates when you update anything.
|
|
|
|
|
|
|
|
Jun 17 2009, 08:55 PM
|
QA
Group: Administrators
Posts: 1,864
Joined: 10-August 07
Member No.: 6

|
QUOTE (Wackyjazz @ Jun 17 2009, 06:25 PM)  You might want to take a peek at my .ashx sitemap. This auto sitemap has only a fraction of the url's listed. Example: I have 817 products, yet the .ashx sitemap shows just 628 urls. Why is there such a huge differance in numbers? I took a peek at that from this thread: http://forums.networksolutions.com/e-comme...ashx-t4487.htmland was looking into it but lost track. I'll look into it more. There are URLs that are excluded from the sitemap for various reasons, but I don't have the list and the reasons for exclusion at this time. I emailed myself the other post as a reminder.
|
|
|
|
|
|
|
|
Jun 18 2009, 05:34 AM
|
Group: Verified NS Member
Posts: 210
Joined: 27-June 08
From: em thar hills in Virginia
Member No.: 1,342

|
Thanks, I searched for that post but for some reason couldn't locate it. I am showing over 900 URLs for the sitemap that I created using the gsitecrawler. I manually scroll through the gsite sitemap to ensure there are no links that are not products, categories or seo pages. I do not use urls such as search by price, email friend, etc. One other thing, we have loaded 10 or so new items in the past week, yet I dont see the number of urls on the .ashx sitemap increasing. QUOTE (ddavisNS @ Jun 17 2009, 10:13 PM)  I took a peek at that from this thread: http://forums.networksolutions.com/e-comme...ashx-t4487.htmland was looking into it but lost track. I'll look into it more. There are URLs that are excluded from the sitemap for various reasons, but I don't have the list and the reasons for exclusion at this time. I emailed myself the other post as a reminder.
|
|
|
|
|
|
|
|
Jun 18 2009, 11:24 AM
|
Group: Verified NS Member
Posts: 287
Joined: 13-August 08
Member No.: 1,898

|
QUOTE (Wackyjazz @ Jun 17 2009, 07:25 PM)  Hi Dave,
You might want to take a peek at my .ashx sitemap. This auto sitemap has only a fraction of the url's listed. Example: I have 817 products, yet the .ashx sitemap shows just 628 urls. Why is there such a huge differance in numbers? I use Gsitecrawler as well. I have about 1500 products but when I run gsitecrawler I get approximately 8500 URL's? I guess it includes every image URL as well as some html url's from File manager that came over from the old Monster Commerce version. I took a look at the auto generated ashx sitemap, how do you know how many URL's there are when looking at that report? Also, someone said earlier in this thread that you can submit multiple sitemaps to google webmaster tools for the same website? is this true? Msuz.
|
|
|
|
|
|
|
|
Jun 18 2009, 11:36 AM
|
QA
Group: Administrators
Posts: 1,864
Joined: 10-August 07
Member No.: 6

|
QUOTE (msuz @ Jun 18 2009, 11:42 AM)  I took a look at the auto generated ashx sitemap, how do you know how many URL's there are when looking at that report? You submit it to google and it tells you the count QUOTE (msuz @ Jun 18 2009, 11:42 AM)  Also, someone said earlier in this thread that you can submit multiple sitemaps to google webmaster tools for the same website? is this true? Yes, google only allows 10mb sitemaps, but it allows you to have multiple 10mb sitemaps. So if your autogenerated sitemap comes out to be over 10mb you can't use it as we don't currently split it into 10mb chunks.
|
|
|
|
|
|
|
|
Jun 18 2009, 12:21 PM
|
Group: Verified NS Member
Posts: 210
Joined: 27-June 08
From: em thar hills in Virginia
Member No.: 1,342

|
Msuz.. I dont use the picture urls in my sitemap... Just wondering are you importing your robot.txt file to the crawler? QUOTE (msuz @ Jun 18 2009, 12:42 PM)  I use Gsitecrawler as well. I have about 1500 products but when I run gsitecrawler I get approximately 8500 URL's? I guess it includes every image URL as well as some html url's from File manager that came over from the old Monster Commerce version.
I took a look at the auto generated ashx sitemap, how do you know how many URL's there are when looking at that report?
Also, someone said earlier in this thread that you can submit multiple sitemaps to google webmaster tools for the same website? is this true?
Msuz.
|
|
|
|
|
|
|
|
Jun 18 2009, 12:33 PM
|
Group: Verified NS Member
Posts: 287
Joined: 13-August 08
Member No.: 1,898

|
QUOTE (Wackyjazz @ Jun 18 2009, 01:39 PM)  Msuz..
I dont use the picture urls in my sitemap...
Just wondering are you importing your robot.txt file to the crawler? Wacky, Yes I am using the robot.txt file.....which is weird. Maybe we should compare our robots files? In fact, I have to eliminate several thousand URL on top of that, I get the ?previous and ?next url's as well as various $$$ over/ under URL's....... Msuz.
|
|
|
|
|
|
|
|
Jun 18 2009, 01:11 PM
|
Group: Verified NS Member
Posts: 210
Joined: 27-June 08
From: em thar hills in Virginia
Member No.: 1,342

|
I have found that sometimes you have to reset the robot.txt file in the crawler. You and find the button to do this at the bottom of the page where it list your robots.txt file. Sometimes you will get an error message that the crawler is busy, when its not. Stop the crawler, delete any current crawl info, import the robot.txt file and re-crawl.. Feel free to check out/copy my robot.txt file.... Have you had any luck with the images in your sitemap? QUOTE (msuz @ Jun 18 2009, 01:51 PM)  Wacky,
Yes I am using the robot.txt file.....which is weird. Maybe we should compare our robots files?
In fact, I have to eliminate several thousand URL on top of that, I get the ?previous and ?next url's as well as various $$$ over/ under URL's.......
Msuz.
|
|
|
|
|
|
|
|
Jun 18 2009, 05:04 PM
|
Group: Verified NS Member
Posts: 287
Joined: 13-August 08
Member No.: 1,898

|
QUOTE (Wackyjazz @ Jun 18 2009, 02:29 PM)  I have found that sometimes you have to reset the robot.txt file in the crawler. You and find the button to do this at the bottom of the page where it list your robots.txt file. Sometimes you will get an error message that the crawler is busy, when its not. Stop the crawler, delete any current crawl info, import the robot.txt file and re-crawl..
Feel free to check out/copy my robot.txt file....
Have you had any luck with the images in your sitemap? Wacky, I checked your robots file to mine and found the foll: THINGS YOU HAVE AND I DON'T: Disallow: /help.aspx Disallow: /login.aspx Disallow: /search.aspx (you have it twice, I have it once) You also have a link to your sitemap.xml THINGS I HAVE THAT YOU DON'T: Disallow: /Admin/ Disallow: /affiliatewiz/ Disallow: /eproducts/ Disallow: /themes/ Disallow: /custom.css.aspx Disallow: /images/view.aspx Let me know your thoughts? Maybe someone at netsol can comment on the above, as to what we should vs shouldn't have? I personally have never edited the file, so it's exactly as netsol setup... Thanks Msuz.
|
|
|
|
|
|
|
|
Jun 19 2009, 10:36 AM
|
Group: Verified NS Member
Posts: 210
Joined: 27-June 08
From: em thar hills in Virginia
Member No.: 1,342

|
I think I need to add some of the items that you have listed since the affiliatewiz is new. I guess every site is different in what the owner wants indexed vs not indexed. I had to put the search.aspx in twice since the bots would not or did not follow the command and I was getting search pages indexed. QUOTE (msuz @ Jun 18 2009, 06:22 PM)  Wacky,
I checked your robots file to mine and found the foll:
THINGS YOU HAVE AND I DON'T:
Disallow: /help.aspx Disallow: /login.aspx Disallow: /search.aspx (you have it twice, I have it once)
You also have a link to your sitemap.xml
THINGS I HAVE THAT YOU DON'T:
Disallow: /Admin/ Disallow: /affiliatewiz/ Disallow: /eproducts/ Disallow: /themes/
Disallow: /custom.css.aspx Disallow: /images/view.aspx
Let me know your thoughts? Maybe someone at netsol can comment on the above, as to what we should vs shouldn't have? I personally have never edited the file, so it's exactly as netsol setup...
Thanks Msuz.
|
|
|
|
|
|
|
|
Jun 19 2009, 10:43 AM
|
QA
Group: Administrators
Posts: 1,864
Joined: 10-August 07
Member No.: 6

|
QUOTE (msuz @ Jun 18 2009, 05:22 PM)  Wacky,
I checked your robots file to mine and found the foll:
THINGS YOU HAVE AND I DON'T:
Disallow: /help.aspx Disallow: /login.aspx Disallow: /search.aspx (you have it twice, I have it once)
You also have a link to your sitemap.xml
THINGS I HAVE THAT YOU DON'T:
Disallow: /Admin/ Disallow: /affiliatewiz/ Disallow: /eproducts/ Disallow: /themes/
Disallow: /custom.css.aspx Disallow: /images/view.aspx
Let me know your thoughts? Maybe someone at netsol can comment on the above, as to what we should vs shouldn't have? I personally have never edited the file, so it's exactly as netsol setup...
Thanks Msuz. admin affiliatewiz and eproducts were migrated from your 4x site. They can be removed as they don't exist on the 7x site, but they aren't hurting anything because they don't exist on the 7x site. Nothing you have will really hurt anything as nothing you are blocking has much seo value and some things you are blocking don't exist anyway. Search.aspx we added as we had some customers who were linking to searches on every product page and it was causing their site performance to degrade when indexed by google as it would do thousands of searches at once, so I would suggest leaving that blocked unless you know you aren't linking to searches anywhere.
|
|
|
|
|
|
|
|
Jun 19 2009, 11:06 AM
|
Group: Verified NS Member
Posts: 287
Joined: 13-August 08
Member No.: 1,898

|
QUOTE (ddavisNS @ Jun 19 2009, 12:01 PM)  admin affiliatewiz and eproducts were migrated from your 4x site. They can be removed as they don't exist on the 7x site, but they aren't hurting anything because they don't exist on the 7x site. Nothing you have will really hurt anything as nothing you are blocking has much seo value and some things you are blocking don't exist anyway. Search.aspx we added as we had some customers who were linking to searches on every product page and it was causing their site performance to degrade when indexed by google as it would do thousands of searches at once, so I would suggest leaving that blocked unless you know you aren't linking to searches anywhere. Dave, Taking my robots file into account, gsite crawler indexes the following URL types that cause me to have 1000's of URLS in my sitemap, including the following types: http://www.nameofmysite.com/search.aspx?ma...amp;category=10 (I get tons of these low-high urls) http://www.nameofmysite.com/search.aspx?ma...2&log=false(still getting these search ones? even thought I have Disallow Search in my robots file) http://www.nameofmysite.com/productorcateg...ame.aspx?page=3 ( I get every page 2, 3, 4, 5 etc....) Also getting every product three times, because of the ?next and ?previous page links.... Can you advise on some things to add to the robots file to cut this down? Also do I need to put Disallow: /Search in twice as it seems to still pickup some search.aspx Thanks Msuz.
|
|
|
|
|
|
|
|
Jun 19 2009, 11:11 AM
|
QA
Group: Administrators
Posts: 1,864
Joined: 10-August 07
Member No.: 6

|
QUOTE (msuz @ Jun 19 2009, 11:24 AM)  Dave, Taking my robots file into account, gsite crawler indexes the following URL types that cause me to have 1000's of URLS in my sitemap, including the following types: http://www.nameofmysite.com/search.aspx?ma...amp;category=10 (I get tons of these low-high urls) http://www.nameofmysite.com/search.aspx?ma...2&log=false(still getting these search ones? even thought I have Disallow Search in my robots file) http://www.nameofmysite.com/productorcateg...ame.aspx?page=3 ( I get every page 2, 3, 4, 5 etc....) Also getting every product three times, because of the ?next and ?previous page links.... Can you advise on some things to add to the robots file to cut this down? Also do I need to put Disallow: /Search in twice as it seems to still pickup some search.aspx Thanks Msuz. I've never had much luck with gsitecrawler. That being said, there is a disallow section in gsitecrawler, you should probably use that instead of relying on it being able to parse robots.txt. Disallowing something multiple times in robots.txt has the same effect as disallowing it once.
|
|
|
|
|
|
|
|
Jun 19 2009, 12:41 PM
|
Group: Verified NS Member
Posts: 133
Joined: 2-June 08
Member No.: 1,218

|
Can we use the same generator for yahoo as well? xml-sitemap.ashx
|
|
|
|
|
|
1 User(s) are reading this topic (1 Guests and 0 Anonymous Users)
0 Members:
|