Wikimedia Foundation Data Retention Guidelines

From Wikimedia Foundation Governance Wiki
This page is a translated version of the page Legal:Data retention guidelines and the translation is 62% complete.

介紹

數據很重要。這是我們作為一個組織和一個運動學習和發展的方式之一,以及我們如何幫助那些使用它們創造,學習和分享的人更好地完成項目。同時,我們致力於「在維護,理解和改進維基媒體網站的最短時間內保存您的個人數據,以及我們根據適用的美國法律承擔的義務」(維基媒體基金會引用私隱政策)。

本文檔幫助說明我們如何履行這個承諾,通過描述我們用於數據保留、系統設計和進行中的審查與維護的指引。這些指引將變成一份現存文檔——它們將被同時更新以反映現有保留實踐的問題。

什麼數據會受本指引影響?

這些指引適用於我們從私隱政策非Wiki私隱政策所涵蓋的維基媒體網站收集的所有非公開數據。我們捐贈者私隱政策包括適用於捐贈者信息的單獨數據保留指南。

我們的非公開數據會保留多久?

除非第三方要求或受不可抗力影響,我們將按照以下列表酌情定義數據保留期限:

數據類型 來源 例子 最高保留期限
非公開的個人信息 從用戶自動收集
  • 站點訪問者的IP位址(運作數據)
  • A/B測試項目的IP位址(分析數據)
  • Identifying user-agent information of site visitors
最多90天後,它將被刪除,匯總或取消標識
帳戶設定
  • 電子郵件信箱
直到使用者刪除/修改其帳戶設定
非個人信息 從用戶自動收集 無限期
最多90天後,它將被刪除,匯總或取消標識
由用戶提供
  • 輸入到網站搜索框中的術語日誌,或用戶導航後跟隨搜尋引擎的預填充連結中的術語
最多90天後,它將被刪除,匯總或取消標識
由用戶提供
  • 語言
直到使用者刪除/修改其帳戶設定
非個人信息[T 1] 從各類用戶自動收集 無限期
由讀者瀏覽的條目 從讀者自動收集
  • 讀者訪問的條目列表
至多90天後,如果保留,則只以匯總的形式。
  1. 出於本表的目的,用戶帳戶表示用戶名,用戶ID或IP位址;讀者是指維基媒體項目的訪問者。

我們保留公共數據多長時間?

維基媒體託管維基百科及相關項目,作為我們收集,記錄和自由分發人類知識總和的使命的一部分。因此,當您為任何維基媒體網站(包括用戶或討論頁面)做出貢獻時,您將創建一個永久性的公共記錄,記錄您添加,刪除或更改的每個內容。 頁面歷史記錄將顯示您的貢獻或刪除時間,以及您的用戶名(如果您已登錄)或您的IP位址(如果您未登錄)。我們可能會使用您的公共捐款,或者與他人的公共捐款或單獨捐贈,為您創建新功能或數據相關產品,或者了解有關維基媒體網站如何使用的更多信息。 如果您錯誤地將您的個人信息包含在對維基媒體網站的貢獻中,並且您希望將其刪除,請諮詢社區的監督政策。 請記住,我們網站修訂歷史的透明度和完整性對我們的使命至關重要,基金會支持我們社區拒絕監督請求以保護項目的權利。

如果您選擇註冊維基媒體項目的帳戶,系統會要求您選擇用戶名。用戶名將保留,直到用戶請求該帳戶為已被重命名,或通過社區隱退流程。

請參閱我們的私隱政策獲取更多信息。

定義

For the purposes of these guidelines:

  • "Personal information" means information you provide us or information we collect from you that identifies or could be used to personally identify you. For details, please see the Wikimedia Foundation Privacy Policy and Non-Wiki Privacy Policy.
  • Some examples of "public information" would include:
    • (a) your IP address, if you edit without logging in;
    • (b) your gender, if it is disclosed under your user profile;
    • (c) any personal information you disclose publicly on the Wikimedia Sites, such as your real name or age.
  • Some examples of types of information that are considered to be "nonpublic information" include:
    • (a) your IP address, if you edit while logged in;
    • (b) your email address, if you provided one to us during account registration (but did not post it publicly); and
    • (c) your general location information as might be derived from your IP address, if you have not posted it publicly. The types of information that are considered "nonpublic" as opposed to "public" are more fully explained in our Privacy Policy.
  • Data is "de-identified" when it has been aggregated or otherwise retained in a manner such that it can no longer be used to identify the user.
  • Data is "aggregated" when the data associated with a specific user has been combined with data from others to show general trends or values without identifying specific users.

數據如何聚合的例子:

Using ranges rather than specific numbers, such as recording that there are "between 1 and 10 editors in language X in country Y" rather than recording that there are 4 editors.

Terms that are not defined in this document have the same meaning given to them in the Privacy Policy.

方針例外情況

If we make exceptions to these guidelines, we will notify the community by describing the exception on this page.

  • 數據也許會在系統備份中保留更長周期,但最多不會超過5年。
  • When we conduct a survey or other research, we will provide you with a privacy statement specifying the term of retention for information (including personal information) collected through your participation in such research. In certain cases, information may be retained indefinitely for educational, development, or other related purposes, unless otherwise indicated in the relevant privacy statement. Such information may be retained in raw, aggregated, or de-identified form until we receive a request from the participant to delete the information.
  • Research related to COVID-19: The Wikimedia Foundation Research team is conducting research regarding COVID-19 and its impact on Wikipedia. Retaining de-identified readership data from COVID-19 related articles will enable us to better understand how to prioritize content creation, to understand what happens to readership when there is a "shock to the system", and to empower the research community to answer such questions. By "COVID-19 related articles", we mean articles that link to the COVID-19, SARS-CoV-2 and 2019-2020 COVID-19 pandemic Wikidata items. For comparison purposes, we will retain data from a small number of articles unrelated to COVID-19 as well. In order to collect sufficient data, and obtain a picture of readership as time passes, we will be retaining this de-identified data beyond the 90-day retention limit, for a period of one year, ending on March 1, 2021. (Note that this includes a one-month extension due to staffing changes, in order to allow for the project's completion.). For technical details about the sampling and de-identification process, please see the project page on GitHub.
  • Editing research: There is a short-term extension applying to data collected as part of experimental features to improve replying on talk pages. In order to collect and analyze sufficient data, this data must be kept beyond the standard 90-day period. The retained data will be deleted, aggregated, or de-identified within 180 days.
  • Campaign landing pages: for certain events, campaigns, or marketing channels, users may create accounts on special landing pages. After creating their account on those pages, the association between their account and its source may be retained indefinitely, both to provide a good user experience for that account and for longitudinal analysis on campaign effectiveness. For more information, contact mmiller@wikimedia.org.
  • CampaignEvents extension: An exception exists for data collected by the CampaignEvents extension. The extension collects the global user IDs of event organizers and event participants, as well as which events users organized or attended and when participants registered for an event. In order for the extension features to work consistently, data collected by the CampaignEvents extension may be retained indefinitely.
  • Sound logo contest: There is a short-term extension applying to data collected as part of contest entries to allow the brand studios team to evaluate entries in preparation for announcing the winner in February 2023. The retained data will be deleted, aggregated or de-identified within 90 days after the winner is announced.
  • In rare cases, we, or particular users with certain administrative rights as described in our Privacy Policy, may need to retain your personal information, including your IP address and user agent information, for as long as reasonably necessary (which may be longer than the period described in the table above) to:
    • enforce or investigate potential violations of our Terms of Use, this Privacy Policy, or any Foundation or user community-based policies;
    • investigate and defend ourselves against legal threats or actions;
    • help protect against vandalism and abuse, fight harassment of other users, and generally try to minimize disruptive behavior on the Wikimedia Sites;
    • prevent imminent and serious bodily harm or death to a person, or to protect our organization, employees, contractors, users, or the public; or
    • detect, prevent, or otherwise assess and address potential spam, malware, fraud, abuse, unlawful activity, and security or technical concerns.

Audits and improvements

The Foundation is committed to continuous evaluation and improvement of these guidelines, and to periodic audits in order to identify such improvements. As we make changes to existing and systems, we will update these guidelines to reflect our changing practices.

新系統設計

In order to support these data retention periods and our overall privacy policy, new tools and systems implemented by the Foundation will be designed with privacy in mind. This will include:

  • inclusion of these data retention guidelines as requirements during the design process;
  • legal consultation during the design and development process; and
  • inclusion of privacy considerations in the code review process.

仍在進行中的新信息處理

Despite our best efforts in designing and deploying new systems, we may occasionally record personal information in a way that does not comply with these guidelines. When we discover such an oversight, we will promptly comply with the guidelines by deleting, aggregating, or de-identifying the information as appropriate.

聯繫我們

If you think that these guidelines have potentially been breached, or if you have questions or comments about compliance with the guidelines, please contact us at privacy@wikimedia.org.

私隱相關頁面