Loading
  • 21 Aug, 2019

  • By, Wikipedia

Wikipedia:1.0 Team

schools selection
Offline WP for Indian Schools


CORE TOPICS
CORE SUPPLEMENT
Core topics - 1,000
(Talk) (COTF) (bot)
TORRENT (Talk) "Selection" project for kids ((t)) WORK VIA WIKI
PROJECTS
(talk)
Pushing to 1.0 (talk)
Static content subcom.

This is the Version 1.0 Editorial Team page.

General background

Article ratings assessment scheme

In late 2003, Wikipedia co-founder Jimmy Wales had proposed making an offline release version of Wikipedia. This group was formed in late 2004 to meet this challenge. Our work involves identifying and organizing articles, and improving and maintaining a core set. Our work does not hinder the existing wiki process for creating and editing articles, but rather it supports that work by providing additional organization. We aim to produce collections that can be used in places where the internet coverage is expensive or non-existent. Our early collections were distributed via DVD; now these are shared via download, then distributed on hardware such as a Raspberry Pi. Originally, only a fixed selection was available, but there is now much flexibility in how selections can be made. This project is now mainly one point in a network of groups who collect and distribute open educational resources from the Internet in an offline form.

See these more detailed related articles:

How you can help

You are encouraged to join us and help out with one of the projects, or to discuss Wikipedia 1.0 on the talk page. A significant part of our work centers around maintaining the assessment scheme, which is now used on more than seven million articles by over 1000 active WikiProjects on the English Wikipedia. It is also being used on other language projects. Generally work on this team is sporadic – periods of hectic activity followed by long periods of waiting! Often work is long and tedious – checking through a list of 22,000 instances of profanities one by one, organizing 10,000 keywords taken from category names, or dealing with technical bugs when the assessment bot fails for no apparent reason. However, it is all worth it in the end.

Our strategy has been intensely debated, but the group has reached a consensus. We elected not to follow the German model. Instead we chose to start with a core of quality articles on key subjects and expand from there. We have produced three test versions: Version 0.5, Version 0.7, and Version 0.8 with the goal of releasing better collections of articles in due course. The next general release is generically referred to as "Release Version" while our first "official" comprehensive release will be called Version 1.0. These collections are then made available for offline use using a reader such as Kiwix, which was chosen as Sourceforge project of the month. The project was on hiatus for several years because of the loss of our main developer. However, as of February 2016, a new group of developers has begun work on upgrading the code and the process, in order to start producing new collections again, especially collections for schools.

Current needs

Although we have much of the requisite system automated, there are still some outstanding tasks:

  • Preparation of a reliable index. If you can write code and you're interested in how to map category trees into a useful index (not as easy as it sounds!) please contact Walkerma.
  • Reviewing manual nominations. Whenever there is a new release being planned, we need volunteers to review a few articles and process them.
  • Propose useful "guide" pages to be added, such as lists and disambiguation pages.
  • Check for vandalism in the selected version-IDs of the articles.
  • Develop nice pages for navigation through the content, such as subject portals.
  • Test the reader software, and find and report bugs.
  • Help with distribution, especially in remote areas without Internet access.

Please let us know on the Talk page if you can help with any of these.

A page read offline in Kiwix

Status

At present, the main activities are:

  • The assessment scheme, which is used by WikiProjects for organizing their content, using talk page tags and the WP1.0 bot. The bot was updated with completely new Perl code in 2020–2021, and it is currently maintained by User:Audiodude. Technical problems with the bot should be reported here. Related to this work is the WP1.0 server (previously called "Release Version Tools) which provides ways for WikiProjects to analyze article lists and data relating to their work.
  • Collaborations to produce offline collections are done in collaboration with various people from Kiwix and Internet-in-a-Box. Please contact Walkerma if you wish to help.

To select articles, we are mainly using a bot-assisted selection process based on assessment by individual WikiProjects, where articles are selected automatically based on quality and importance project rankings.

RevID selection

Based on discussions (at the 2017 Potsdam hackathon and since), we plan to reactivate RevID selection. Previously code based on WikiTrust was used in Version 0.8, and this appeared to produce a largely vandalism-free collection of articles. This worked by scoring each RevID based on the edits remaining in it, and choosing the most "trustworthy" recent RevID based on the WikiTrust algorithm.

Wikipedia 1.0 projects

Active projects

If you would like to start a new project, please discuss it on the talk page first before adding it here.

Wikipedia 1.0 Projects
Name Summary of overall strategy Coordinator Description of activities
School selection Put together selections of 1–10 GB sizes for use in high schools and elementary schools User:Walkerma and others Uses new code that starts with a seed and works out, guided by the WP 1 selection ranking to guide it
Work via WikiProjects (WVWP) Use "networking" to mobilise our existing subject specialists User:Walkerma Organise and facilitate compilation of article lists from the WikiProjects and seek to identify important topics within each WikiProject's area of expertise. Locate important topics that are currently not being managed by projects. In conjunction with WP:COUNCIL, the project serves as a link with the editing community, and may later help locate expert reviewers.

Past releases

Past Wikipedia 1.0 Projects
Name Summary of overall strategy Month of release Description of activities Website Next release
Version 0.5 A test release prior to release of Version 1.0 above. April 2007 A test release designed to pave the way for Version 1.0. Used manual nominations and approval based on importance and quality. Approval was by only one person, from the review team. Okawix Version 0.7
Version 0.7 A test release of automated article selections, prior to release of Version 1.0 above. Early 2010 A test release designed to pave the way for Version 1.0. Used SelectionBot to make an article selection based on importance and quality. Vandalism prevention used a script, with manual checks, which delayed the release significantly. [1] Kiwix reader, ZIM download Version 0.8
Version 0.8 A test release of automated article selections, prior to release of Version 1.0 above. March 2011 A test release designed to pave the way for Version 1.0. Version 0.8 used bot-assisted article selection, with manual adjustments based on feedback from WikiProjects. Used as a test of the WikiTrust revisionID selection code - this worked well. Wikipedia:Version 0.8/downloads. Version 0.9
2006 Wikipedia CD Selection (previously called "Test Version") Work with release version done off site that was coordinated by BozMo April 2006 2000 articles with content filtered/selected for use by children (see Wikipedia:Wikipedia CD Selection). No longer available - see 2008/9 release below 2007 Wikipedia CD Selection (below)
2007 Wikipedia CD Selection Work with release version done off site that was coordinated by BozMo May 2007 4655 articles with content filtered/selected for use by children (see Wikipedia:Wikipedia CD Selection). No longer available - see 2008/9 release below 2008/9 Wikipedia CD Selection (below)
2008/9 Wikipedia CD Selection Work with release version done off site that was coordinated by BozMo October 2008 5502 articles with content filtered/selected for use by children (see Wikipedia:Wikipedia CD Selection). http://schools-wikipedia.org Not yet known

Inactive projects

Inactive Wikipedia 1.0 Projects
Name Summary of overall strategy Coordinator Description of activities
Authoritative Editions Gives experts in each field the tools to review suitable articles in their area of expertise and give their okay to particular revisions. Sj, Jeff Keller, Gnp
  1. Allow editors to propose new groups; for example: "Featured Article review" or "American Physical Society peer review".
  2. Each blessing group would have a set of review guidelines.
  3. Each blessing group would consist of this set of guidelines, and a set of users who could bless a revision of an article as satisfying them.
  4. Articles with blessed revisions would display icons or links to last-blessed revisions.
Featured Articles First reviewing older or problem FAs to ensure that quality is maintained, ready for inclusion of these articles in Wikipedia 1.0. (Now a standard part of the FA system)
Geography project to produce a descriptive gazetteer of the world for publication. This could include an atlas, continents, countries and major cities. This would serve as a test bed for publishing Wikipedia 1.0, but could also be a valuable stand-alone product.
Three Level Editing Users participate in a three part editing process to assure that pages are up to quality standards. The first level is just a general check, the second level is a factual check, and the third level is a last "just in case" check. This process would assure that articles would be up to standards without putting too much responsibility on one user.
Biographies Improve and assess biographical articles Focus especially on the 200 Core Biographies, in conjunction with Wikipedia:WikiProject Biography.

.

Dynamic Pocket Cyclopedia Lists most important FAs, FLs, and GAs. An evolving list of no more than half of Wikipedia's most important featured articles, featured lists, and designated good articles.
WikiSort Integrate the sifting process into the Wiki. Aims to use data from the planned user rating scheme to provide rankings of articles, such that important quality articles can be automatically identified for inclusion in Wikipedia 1.0. This project has been rendered obsolete by WikiProject-based assessments (see above).
Article assessment To assess articles This project has been rendered obsolete by WikiProject-based assessments (see above).

Publishing steps

The process of generating an offline version of a sub-selection of Wikipedia article is multistage. It needs many dedicated and singled-purposed operations. The following chart show how the WP1 project envisioned things in 2010.

The general process for producing an offline release

Even if this chart is still, to a large extend, valid; we practice and envision things slightly differently nowadays. One of the most important paradigm change we had to make is to remove as much as possible human based manual activity because the amount of work is simply too high to be achieved in a reasonable amount of time. We tend now to automate as much as possible the whole process. As a consequence the project is now predominantly focused on technology.

Technical approach

Support Wikiproject assessment effort

The first software created to support the WP1 project has been the User:WP_1.0_bot. First written in Perl by User:CBM and then slighly modified and maintained by a few other volunteers. In 2020 the bot has been totally rewritten in Python following modern development standards (API, automated tests, etc.) by User:Audiodude. The code base is available en developed on Github.

The WP1bot had and still have three traditional purposes:

  • gather assessments (via categories introduced on main namespace articles talk pages),
  • upload on Wikipedia logs & stats
  • provide key information & tools to Wikiproject on a dedicated Web service. The data can also be accessed through an API at api.wp1.openzim.org

Select article titles

...

Select article revision

...

Scrape selected articles for offline usage

...

Orchestrate periodic and multiple scraping

...

Publish and distribute offline snapshots

...

Statistics

The WP 1.0 bot tracks assessment data (article quality and importance data for individual WikiProjects) assigned via Talk page banners. If you would like to add a new WikiProject to the bot's list, please read the instructions at Wikipedia:Version 1.0 Editorial Team/Using the bot.

The global summary table below is computed by taking the highest quality and importance rating for each assessed article in the main namespace.

General

Assessment and validation

Wikipedia books

  • Wikipedia:Books & meta:WikiReader - Wikipedia books are collections of articles from Wikipedia on a certain topic, in the form of PDFs published for download and intended to be printed, and also to be sold in printed form.
  • The Book Tool, and Wiki to print, a collaboration between the Foundation and OSI/PediaPress.

Article selections

See also