User:NeilK/Sitemaps: Difference between revisions

From Wikimedia Foundation Governance Wiki
Content deleted Content added
NeilK (talk | contribs)
NeilK (talk | contribs)
No edit summary
Line 1: Line 1:
Google Image Search asks about Sitemaps. meeting in march.
Google Image Search asks about Sitemaps. meeting on Feb 28.


== Resources ==
== Resources ==
Line 22: Line 22:


It is unclear why we stopped. Brion believes that Jens Frank (JeLuf on IRC) was the one in charge of this. Emailing him to find out.
It is unclear why we stopped. Brion believes that Jens Frank (JeLuf on IRC) was the one in charge of this. Emailing him to find out.

== Ideas ==

Considering that dumps are now regular again (?) perhaps this should be related.

Or not? It is also sometimes important to be timely. It would be nice if we had a script to run daily to append new articles to the last leaf of a tree of sitemap index files (or entire new leaves as appropriate). Then we can regenerate the entire tree now and then to get rid of deleted pages.

Revision as of 20:51, 25 February 2011

Google Image Search asks about Sitemaps. meeting on Feb 28.

Resources

http://www.mediawiki.org/wiki/Manual:GenerateSitemap.php

The manual page indicates these are not compatible with Google as of 1.16, but a patch can fix that. Did this get fixed in 1.17? http://www.mediawiki.org/wiki/Manual_talk:GenerateSitemap.php#A_BUG_FIX_to_work_the_Google_Webmaster_Tools

There are a number of other tools: http://www.mediawiki.org/wiki/Extension:Google_Sitemap (may be obsolete)

This user created another script... it is unclear why he thought it was necessary to write his own, perhaps this works better with multiple sites. http://www.mediawiki.org/wiki/User:DaSch/generateSitemap.php

History

Consensus from Brion Vibber, Ariel T. Glenn, etc, is that we used to run Sitemaps but haven't since 2008. https://bugzilla.wikimedia.org/show_bug.cgi?id=13693 suggests the exact date was 2007-12-27.

Brion believes the standard generateSitemap.php script was the one being used.

It is unclear why we stopped. Brion believes that Jens Frank (JeLuf on IRC) was the one in charge of this. Emailing him to find out.

Ideas

Considering that dumps are now regular again (?) perhaps this should be related.

Or not? It is also sometimes important to be timely. It would be nice if we had a script to run daily to append new articles to the last leaf of a tree of sitemap index files (or entire new leaves as appropriate). Then we can regenerate the entire tree now and then to get rid of deleted pages.