Policy talk:User-Agent policy: Difference between revisions

From Wikimedia Foundation Governance Wiki
Latest comment: 2 years ago by Enterprisey in topic Refresh needed
Content deleted Content added
→‎WikimediaBot: new section
m Bot: Archiving 2 threads (older than 180 days) to Policy talk:User-Agent policy/Archive 4
 
(34 intermediate revisions by 15 users not shown)
Line 1: Line 1:
{{talk}}
I'm trying to run a script to collect some data from Wikipedia for the first time and I've run into this measure. Is there any way I could see the format of the user_agent string that Wikimedia expects so as to properly format my settings? I tried a vernacular description, but still ended up with the "give a descriptive user agent" response. [[User:Elijahmeeks|Elijahmeeks]] 02:05, 11 April 2011 (UTC)
{{User:LincolnBot/archiveconfig
:Never mind, I didn't realize I had to set it in curl instead of as the usual initialization.[[User:Elijahmeeks|Elijahmeeks]] 02:26, 11 April 2011 (UTC)
|archive = Policy talk:User-Agent policy/Archive %(counter)d
|algo = old(180d)
|counter = 4
|maxarchivesize = 150K
|archiveheader = {{talk archive}}
|minthreadstoarchive = 1
|minthreadsleft = 3
}}


== More strict contact information ==
== Posting ==


As [[User:Mforns (WMF)]] has mentioned above, the existing policy wrt providing contact information is loose. Working together with the ops and client community we could improve that part of the policy and perform audits on client compliance assisted by analytics data identifying the main non-compliant clients that need to be upgraded.
Strangely enough, I get the message about a temporary server problem unless I take out this line:
<source lang="php">
curl_setopt($ch,CURLOPT_POST,TRUE);
</source>
Not sure why that might be. [[User:Leucosticte|Leucosticte]] ([[User talk:Leucosticte|talk]]) 09:26, 4 August 2012 (UTC)


For example, we could change the 'e.g.' to be a fixed list of only three options:
"temporary server problem" is strange indeed, but i bet it is "you're sending a POST request when the server expect a GET request". [[User:Divinity76|Divinity76]] ([[User talk:Divinity76|talk]]) 12:43, 12 March 2013 (UTC)
# email,
# URL with an email address prominently placed on the page,
# interwiki syntax enclosed in [[ ]] where the first prefix in the interwiki is a valid interwiki prefix present on [[Interwiki Map]].
## If it is a user page, [[special:emailuser]] must be enabled for the linked user.
## Otherwise an email address must be prominently placed on the page.


[[User:John Vandenberg|John Vandenberg]] ([[User talk:John Vandenberg|talk]]) 14:43, 21 March 2016 (UTC)
== Does my bot need a bot user agent? ==


== I cant download images with a .net console application ==
I'm working on an irc bot. I want to make it have a command !wikipedia <something>. The command will print the first 300 or so letters of the wikipedia article on <something>. As it will just read data from pages, do I need a bot user agent?
Thanks.
[[Special:Contributions/76.246.45.128|76.246.45.128]] 02:36, 28 March 2013 (UTC)
: Yes. [[User:Anomie|Anomie]] ([[User talk:Anomie|talk]]) 12:39, 28 March 2013 (UTC)
:: Ok, thanks. It has a bot user agent now :) [[Special:Contributions/76.246.52.64|76.246.52.64]] 03:55, 4 April 2013 (UTC)


I am trying to download about 250 images to my pc with a script. I am waiting 5 seconds in between each image, so as to not use too much bandwidth. I have followed all the user agent suggestions as closely as I can.
== User-Agent restriction or lack thereof ==
this is my user agent string "bot by <My Name> (<My email address>) I am downloading an image of each carnivoran once (less than 300 images) for educational purposes"
You say, make it descriptive, include the word bot if its a bot, include contact details in parentheses, all of which I have done.
But I still get this error after downloading 3 images
System.Net.WebException: 'The remote server returned an error: (403) Forbidden. Please comply with the User-Agent policy: https://meta.wikimedia.org/wiki/User-Agent_policy. '
I have posted about this on stack overflow, you can see the code in question.


https://stackoverflow.com/questions/69230731/wikipedia-user-agent-problem-when-downloading-images
According to the article, the various MediaWiki sites are supposed to generate errors given an empty/missing/blacklisted User-Agent string. Is this supposed to be only on editing or all the time? Skimming the old discussion, it sounds like all requests should fail, but I just tried sending various read-only requests to multiple MW sites with no/empty/"lwp bla" User-Agents via both index.php and api.php and they all came back fine. What gives? [[User:RobinHood70|RobinHood70]] ([[User talk:RobinHood70|talk]]) 07:42, 26 August 2014 (UTC)
:MediaWiki? This is described as a [[Wikimedia projects|Wikimedia]] policy. Also, it says "may"; there is probably a soft enforcement. --[[User:Nemo_bis|Nemo]] 10:18, 26 August 2014 (UTC)
::Oops, forgot to monitor this page and only noticed your reply now. I had meant Wikimedia, not MediaWiki. My bad. I hadn't thought of a soft enforcement scenario. Thanks for that. [[User:RobinHood70|RobinHood70]] ([[User talk:RobinHood70|talk]]) 15:28, 3 December 2014 (UTC)


{{unsigned|89.106.141.56|11:47, 18 September 2021 (UTC)}}
== WikimediaBot ==


== Refresh needed ==
There is a proposal by the Analytics team to require that 'bots' add 'WikimediaBot' to their user-agent even for reads. See https://lists.wikimedia.org/pipermail/analytics/2016-January/004858.html [[User:John Vandenberg|John Vandenberg]] ([[User talk:John Vandenberg|talk]]) 10:32, 28 January 2016 (UTC)

This page needs a refresh, better thematic segmentation. I came with the will to edit it but the translates tags makes me reluctant to edit. If you have experience in those and want to team up to revamp this page, ping me. [[User:Yug|Yug]] ([[User talk:Yug|talk]]) 22:33, 28 December 2021 (UTC)

:Done, sorta. [[User:Enterprisey|Enterprisey]] ([[User talk:Enterprisey|talk]]) 22:45, 3 March 2022 (UTC)

Latest revision as of 21:11, 29 March 2024

More strict contact information

As User:Mforns (WMF) has mentioned above, the existing policy wrt providing contact information is loose. Working together with the ops and client community we could improve that part of the policy and perform audits on client compliance assisted by analytics data identifying the main non-compliant clients that need to be upgraded.

For example, we could change the 'e.g.' to be a fixed list of only three options:

  1. email,
  2. URL with an email address prominently placed on the page,
  3. interwiki syntax enclosed in [[ ]] where the first prefix in the interwiki is a valid interwiki prefix present on Interwiki Map.
    1. If it is a user page, special:emailuser must be enabled for the linked user.
    2. Otherwise an email address must be prominently placed on the page.

John Vandenberg (talk) 14:43, 21 March 2016 (UTC)Reply

I cant download images with a .net console application

I am trying to download about 250 images to my pc with a script. I am waiting 5 seconds in between each image, so as to not use too much bandwidth. I have followed all the user agent suggestions as closely as I can. this is my user agent string "bot by <My Name> (<My email address>) I am downloading an image of each carnivoran once (less than 300 images) for educational purposes" You say, make it descriptive, include the word bot if its a bot, include contact details in parentheses, all of which I have done. But I still get this error after downloading 3 images System.Net.WebException: 'The remote server returned an error: (403) Forbidden. Please comply with the User-Agent policy: https://meta.wikimedia.org/wiki/User-Agent_policy. ' I have posted about this on stack overflow, you can see the code in question.

https://stackoverflow.com/questions/69230731/wikipedia-user-agent-problem-when-downloading-images

—Preceding unsigned comment added by 89.106.141.56 (talkcontribs) 11:47, 18 September 2021 (UTC)Reply

Refresh needed

This page needs a refresh, better thematic segmentation. I came with the will to edit it but the translates tags makes me reluctant to edit. If you have experience in those and want to team up to revamp this page, ping me. Yug (talk) 22:33, 28 December 2021 (UTC)Reply

Done, sorta. Enterprisey (talk) 22:45, 3 March 2022 (UTC)Reply