BitTorrent/fr: Difference between revisions

From Shareaza Wiki
Jump to navigation Jump to search
Line 27: Line 27:


==Operation==
==Operation==
[[Image:Torrentcomp small.gif|thumb|600px|alt=Animation showing 7 remote computers exchanging data with an 8th (local) computer over a network. |In this animation, the colored bars beneath all of the 7 clients in the upper region above represent the file, with each color representing an individual piece of the file. After the initial pieces transfer from the seed (large system at the bottom), the pieces are individually transferred from client to client. The original ''seeder'' only needs to send out one copy of the file for all the clients to receive a copy.]]
[[Image:BitTorrent network.png|thumb|600px|alt=The middle computer is acting as a seed to provide a file to the other computers which act as peers.. |The middle computer is acting as a seed to provide a file to the other computers which act as peers..]]


A BitTorrent client is any program that implements the BitTorrent protocol. Each client is capable of preparing, requesting, and transmitting any type of [[computer file]] over a network, using the protocol. A peer is any computer running an instance of a client.
A BitTorrent client is any program that implements the BitTorrent protocol. Each client is capable of preparing, requesting, and transmitting any type of [[computer file]] over a network, using the protocol. A peer is any computer running an instance of a client.

Revision as of 15:37, 29 March 2014

Présentation :


BitTorrent est un protocole réseau de partage de fichier en pair-à-pairs (P2P) créé par Bram Cohen en avril 2001, qui est utilisé pour la distribution de grande quantité de données. BitTorrent est l'un des protocoles le plus utilisé pour le téléchargement des fichiers de grande taille. En février 2009, des observatoires ont estimés que le protocole représentait entre 27 et 55% du trafic internet selon le lieu géographique.

Description

Le protocole BitTorrent permet de distribuer de large fichier sans alourdir la charge processeur de la source, ni la charge du réseau. Il repose sur une architecture réseau semi-centralisé. Pour tenter d'assurer de meilleure vitesse de transferts, BitTorrent se base sur une limitation du nombre d'utilisateurs vers lesquels sont envoyé les données tout en favorisant l'envoi de données aux utilisateurs depuis lesquels on en reçois le plus.

Plutôt que de télécharger depuis une source unique, l'utilisateur doit rejoindre un "essaim" d'hôtes, pour à son tour télécharger et envoyer simultanément vers les autres hôtes qui constituent cet essaim.

Le protocole BitTorrent est une méthode alternative à l'ancienne source unique, aux serveurs miroir de distribution de données et peut fonctionner sur des réseaux dont la bande passante est réduite, ainsi que sur de petits ordinateurs, tel que sur des téléphones mobile. Un utilisateur qui souhaite diffuser un fichier doit tout d'abord créer un petit fichier de description nommé "*.torrent", qu'il distribue ensuite de façon conventionnel (via internet, email, etc...).

L'utilisateur peut ainsi rendre le fichier disponible via un nœud (tracker) BitTorrent agissant comme une semence ou source (seed en anglais). Ainsi ceux qui possèdent le descripteur de fichier (*.torrent) peuvent distribuer via le tracker BitTorrent, en agissant en tant que peers (seeds)ou leechers.

Le fichier ainsi distribué est divisé en segments appelés morceaux. Chaque pair reçoit une nouvelle pièce du fichier, il devient une source de cette pièce pour d'autres pairs, soulageant ainsi la source d'avoir à envoyer un même morceau copie à chaque leecher. Avec BitTorrent, la tâche de distribuer le fichier est partagé entre tous ceux qui télécharge ce fichier.

Chaque morceaux du fichier est protégé par un hachage cryptographique contenu dans le descripteur de torrent (le fichier *.torrent). Cela permet d’empêcher toutes modifications mal intentionnées de la part d'une source qui souhaite diffuser un contenu corrompu du fichier partagé. Si une source commence avec une copie authentique du descripteur de torrent, il peut vérifier l'authenticité du fichier qu'il reçoit.


When a peer completely downloads a file, it becomes an additional seed. This eventual shift from peers to seeders determines the overall "health" of the file (as determined by the number of times a file is available in its complete form).

This distributed nature of BitTorrent leads to a flood like spreading of a file throughout peers. As more peers join the swarm, the likelihood of a successful download increases. Relative to standard Internet hosting, this provides a significant reduction in the original distributor's hardware and bandwidth resource costs. It also provides redundancy against system problems, reduces dependence on the original distributor<ref>Estimating Self-Sustainability in Peer-to-Peer Swarming Systems by D. Menasche, A. Rocha, E. de Souza e Silva, R. M. Leao, D. Towsley, A. Venkataramani</ref> and provides a source for the file which is generally temporary and therefore harder to trace than when provided by the enduring availability of a host in standard file distribution techniques.

Operation

The middle computer is acting as a seed to provide a file to the other computers which act as peers..
The middle computer is acting as a seed to provide a file to the other computers which act as peers..

A BitTorrent client is any program that implements the BitTorrent protocol. Each client is capable of preparing, requesting, and transmitting any type of computer file over a network, using the protocol. A peer is any computer running an instance of a client.

To share a file or group of files, a peer first creates a small file called a "torrent" (e.g. MyFile.torrent). This file contains metadata about the files to be shared and about the tracker, the computer that coordinates the file distribution. Peers that want to download the file must first obtain a torrent file for it and connect to the specified tracker, which tells them from which other peers to download the pieces of the file.

Though both ultimately transfer files over a network, a BitTorrent download differs from a classic download (as is typical with an HTTP or FTP request, for example) in several fundamental ways:

  • BitTorrent makes many small data requests over different TCP connections to different machines, while classic downloading is typically made via a single TCP connection to a single machine.
  • BitTorrent downloads in a random or in a "rarest-first"<ref name="Rarest First and Choke Algorithms Are Enough">Template:Cite web</ref> approach that ensures high availability, while classic downloads are sequential.

Taken together, these differences allow BitTorrent to achieve much lower cost to the content provider, much higher redundancy, and much greater resistance to abuse or to "flash crowds" than regular server software. However, this protection, theoretically, comes at a cost: downloads can take time to rise to full speed because it may take time for enough peer connections to be established, and it may take time for a node to receive sufficient data to become an effective uploader. This contrasts with regular downloads (such as from an HTTP server, for example) that, while more vulnerable to overload and abuse, rise to full speed very quickly and maintain this speed throughout.

In general, BitTorrent's non-contiguous download methods have prevented it from supporting "progressive downloads" or "streaming playback". However, comments made by Bram Cohen in January 2007 suggest that streaming torrent downloads will soon be commonplace and ad supported streaming appears to be the result of those comments. In January 2011 Cohen demonstrated an early version of BitTorrent streaming, saying the feature will be available by summer 2011.<ref name="Rarest First and Choke Algorithms Are Enough">Template:Cite web</ref>

Creating and publishing torrents

The peer distributing a data file treats the file as a number of identically sized pieces, usually with byte sizes of a power of 2, and typically between 32 kB and 16 MB each. The peer creates a hash for each piece, using the SHA-1 hash function, and records it in the torrent file. Pieces with sizes greater than 512 kB will reduce the size of a torrent file for a very large payload, but is claimed to reduce the efficiency of the protocol.<ref>Theory.org</ref> When another peer later receives a particular piece, the hash of the piece is compared to the recorded hash to test that the piece is error-free.<ref name = "Protocol1.0" /> Peers that provide a complete file are called seeders, and the peer providing the initial copy is called the initial seeder.

The exact information contained in the torrent file depends on the version of the BitTorrent protocol. By convention, the name of a torrent file has the suffix .torrent. Torrent files have an "announce" section, which specifies the URL of the tracker, and an "info" section, containing (suggested) names for the files, their lengths, the piece length used, and a SHA-1 hash code for each piece, all of which are used by clients to verify the integrity of the data they receive.

Torrent files are typically published on websites or elsewhere, and registered with at least one tracker. The tracker maintains lists of the clients currently participating in the torrent.<ref name="Protocol1.0">Template:Cite web</ref> Alternatively, in a trackerless system (decentralized tracking) every peer acts as a tracker. Azureus was the firstTemplate:Citation needed BitTorrent client to implement such a system through the distributed hash table (DHT) method. An alternative and incompatible DHT system, known as Mainline DHT, was later developed and adopted by the BitTorrent (Mainline), µTorrent, Transmission, rTorrent, KTorrent, BitComet, and Deluge clients.

After the DHT was adopted, a "private" flag — analogous to the broadcast flag — was unofficially introduced, telling clients to restrict the use of decentralized tracking regardless of the user's desires.<ref>Template:Cite web</ref> The flag is intentionally placed in the info section of the torrent so that it cannot be disabled or removed without changing the identity of the torrent. The purpose of the flag is to prevent torrents from being shared with clients that do not have access to the tracker. The flag was requested for inclusion in the official specification in August, 2008, but has not been accepted.<ref>Template:Cite web</ref> Clients that have ignored the private flag were banned by many trackers, discouraging the practice.<ref>Template:Cite web</ref>

Downloading torrents and sharing files

Users browse the web to find a torrent of interest, download it, and open it with a BitTorrent client. The client connects to the tracker(s) specified in the torrent file, from which it receives a list of peers currently transferring pieces of the file(s) specified in the torrent. The client connects to those peers to obtain the various pieces. If the swarm contains only the initial seeder, the client connects directly to it and begins to request pieces.

Clients incorporate mechanisms to optimize their download and upload rates; for example they download pieces in a random order to increase the opportunity to exchange data, which is only possible if two peers have different pieces of the file.

The effectiveness of this data exchange depends largely on the policies that clients use to determine to whom to send data. Clients may prefer to send data to peers that send data back to them (a tit for tat scheme), which encourages fair trading. But strict policies often result in suboptimal situations, such as when newly joined peers are unable to receive any data because they don't have any pieces yet to trade themselves or when two peers with a good connection between them do not exchange data simply because neither of them takes the initiative. To counter these effects, the official BitTorrent client program uses a mechanism called "optimistic unchoking", whereby the client reserves a portion of its available bandwidth for sending pieces to random peers (not necessarily known good partners, so called preferred peers) in hopes of discovering even better partners and to ensure that newcomers get a chance to join the swarm.<ref name = "Tamilmanistudy" >Template:Cite web</ref>

Although swarming scales well to tolerate flash crowds for popular content, it is less useful for unpopular content. Peers arriving after the initial rush might find the content unavailable and need to wait for the arrival of a seed in order to complete their downloads. The seed arrival, in turn, may take long to happen (this is termed the seeder promotion problem). Since maintaining seeds for unpopular content entails high bandwidth and administrative costs, this runs counter to the goals of publishers that value BitTorrent as a cheap alternative to a client-server approach. This occurs on a huge scale; measurements have shown that 38% of all new torrents become unavailable within the first month.<ref>Unraveling BitTorrent's File Unavailability:Measurements and Analysis by Sebastian Kaune, Ruben Cuevas Rumin, Gareth Tyson, Andreas Mauthe, Ralf Steinmetz</ref> A strategy adopted by many publishers which significantly increases availability of unpopular content consists of bundling multiple files in a single swarm.<ref>Content Availability and Bundling in Swarming Systems by D. Menasche, A. Rocha, B. Li, D. Towsley, A. Venkataramani</ref> More sophisticated solutions have also been proposed; generally, these use cross-torrent mechanisms through which multiple torrents can cooperate to better share content.<ref>The Seeder Promotion Problem: Measurements, Analysis and Solution Space by Sebastian Kaune, Gareth Tyson, Konstantin Pussep, Andreas Mauthe, Aleksandra Kovacevic and Ralf Steinmetz</ref>

BitTorrent does not offer its users anonymity. It is possible to obtain the IP addresses of all current and possibly previous participants in a swarm from the tracker. This may expose users with insecure systems to attacks.<ref name = "Tamilmanistudy" /> It may also expose users to the risk of being sued, if they are distributing files without permission from the copyright holder(s). However, there are ways to promote anonymity; for example, the OneSwarm project layers privacy-preserving sharing mechanisms on top of the original BitTorrent protocol.

Adoption

A growing number of individuals and organizations are using BitTorrent to distribute their own or licensed material. Independent adopters report that without using BitTorrent technology and its dramatically reduced demands on their private networking hardware and bandwidth, they could not afford to distribute their files.<ref>See, for example, Why Bit Torrent at http://tasvideos.org tasvideos.org.</ref>

Film, video and music

  • BitTorrent Inc. has amassed a number of licenses from Hollywood studios for distributing popular content from their websites.
  • Sub Pop Records releases tracks and videos via BitTorrent Inc.<ref>Template:Cite web</ref> to distribute its 1000+ albums. Babyshambles and The Libertines (both bands associated with Pete Doherty) have extensively used torrents to distribute hundreds of demos and live videos. US industrial rock band Nine Inch Nails frequently distributes albums via BitTorrent.
  • Podcasting software is starting to integrate BitTorrent to help podcasters deal with the download demands of their MP3 "radio" programs. Specifically, Juice and Miro (formerly known as Democracy Player) support automatic processing of .torrent files from RSS feeds. Similarly, some BitTorrent clients, such as µTorrent, are able to process web feeds and automatically download content found within them.
  • DGM Live! purchases are provided via BitTorrent.<ref>DGMlive.com</ref>

Broadcasters

  • In 2008, the CBC became the first public broadcaster in North America to make a full show (Canada's Next Great Prime Minister) available for download using BitTorrent.<ref>Template:Cite news</ref>
  • The Norwegian Broadcasting Corporation (NRK) has since March 2008 experimented with bittorrent distribution, available online.<ref>Bittorrent</ref> Only selected material in which NRK owns all royalties are published. Responses have been very positive, and NRK is planning to offer more content.
  • The Dutch VPRO broadcasting organization released three documentaries under a Creative Commons license using the content distribution feature of the Mininova tracker.

Personal material

  • The Amazon S3 "Simple Storage Service" is a scalable Internet-based storage service with a simple web service interface, equipped with built-in BitTorrent support.
  • Blog Torrent offers a simplified BitTorrent tracker to enable bloggers and non-technical users to host a tracker on their site. Blog Torrent also allows visitors to download a "stub" loader, which acts as a BitTorrent client to download the desired file, allowing users without BitTorrent software to use the protocol.<ref>Template:Cite web</ref> This is similar to the concept of a self-extracting archive.

Software

  • Blizzard Entertainment uses BitTorrent (via a proprietary client called the "Blizzard Downloader") to distribute most content for StarCraft II and World of Warcraft, including the games themselves.<ref>Template:Cite web</ref>
  • Many software games, especially those whose large size makes them difficult to host due to bandwidth limits, extremely frequent downloads, and unpredictable changes in network traffic, will distribute instead a specialized, stripped down bittorrent client with enough functionality to download the game from the other running clients and the primary server (which is maintained in case not enough peers are available).
  • Many major open source and free software projects encourage BitTorrent as well as conventional downloads of their products (via HTTP, FTP etc.) to increase availability and to reduce load on their own servers, especially when dealing with larger files.<ref>Template:Cite web</ref>

Government

Others

Network impact

Template:Update CableLabs, the research organization of the North American cable industry, estimates that BitTorrent represents 18% of all broadband traffic.<ref name="Ellis">Template:Cite web</ref>Template:Update after In 2004, CacheLogic put that number at roughly 35% of all traffic on the Internet.<ref name="Pasick">Template:Cite web</ref>Template:Update after The discrepancies in these numbers are caused by differences in the method used to measure P2P traffic on the Internet.<ref name="Sevcik">Template:Cite web</ref>

Routers that use network address translation (NAT) must maintain tables of source and destination IP addresses and ports. Typical home routers are limited to about 2000 table entries while some more expensive routers have larger table capacities. BitTorrent frequently contacts 300–500 servers per second rapidly filling the NAT tables. This is a common cause of home routers locking up.<ref>Template:Cite web</ref>

Indexing

The BitTorrent protocol provides no way to index torrent files. As a result, a comparatively small number of websites have hosted a large majority of torrents, many linking to copyrighted material without the authorization of copyright holders, rendering those sites especially vulnerable to lawsuits.<ref>Template:Cite web</ref> Several types of websites support the discovery and distribution of data on the BitTorrent network.

Public torrent hosting sites such as The Pirate Bay allow users to search and download from their collection of torrent files. Users can typically also upload torrent files for content they wish to distribute. Often, these sites also run BitTorrent trackers for their hosted torrent files, but these two functions are not mutually dependent: a torrent file could be hosted on one site and tracked by another, unrelated site.

Private host/tracker sites operate like public ones except that they restrict access to registered users and keep track of the amount of data each user uploads and downloads, in an attempt to reduce leeching.

Search engines allow the discovery of torrent files that are hosted and tracked on other sites; examples include Mininova, BTJunkie, Torrentz, The Pirate Bay, Eztorrent and isoHunt. These sites allow the user to ask for content meeting specific criteria (such as containing a given word or phrase) and retrieve a list of links to torrent files matching those criteria. This list can often be sorted with respect to several criteria, relevance (seeders-leechers ratio) being one of the most popular and useful (due to the way the protocol behaves, the download bandwidth achievable is very sensitive to this value). Bram Cohen launched a BitTorrent search engine on http://www.bittorrent.com/search that co-mingles licensed content with search results.<ref>Template:Cite web</ref> Metasearch engines allow one to search several BitTorrent indices and search engines at once.

Vocabulaire

Availability / disponibilité : (aussi appelé « distributed copies ») La quantité de copies complètes du fichier disponibles. Un peer qui se connecte avec une fraction de fichier ajoute cette fraction à la disponibilité (si aucun autre ne possède déjà cette partie du fichier). Chaque seed ajoute 1,0 à ce nombre, étant donné qu'il possède la totalité du fichier. Ainsi, un peer avec 65,3 % d'un fichier téléchargé augmente la disponibilité de 0,653. Si deux peers ont téléchargé chacun la même moitié d'un fichier et qu'ils sont rejoints par un seeder (semeur), la disponibilité s'élève à 1,5.

Leecher : Une « sangsue » (leech en anglais) est un client souhaitant télécharger le fichier, mais ne le possédant pas en totalité. Ce terme a également un sens péjoratif pour qualifier ceux qui téléchargent sans envoyer en contrepartie de fichier sur le réseau. Lors de la première apparition d'un torrent, il n'y a que peu de sources (seeds) et beaucoup de sangsues (leech).

Peers : Clients pouvant transmettre une partie de l'archive. (Il ne possède pas l'intégralité de celle-ci).

Scrape : Le client Bittorrent (voir clients utilisant le réseau BitTorrent) débute par un scrape, ce qui consiste à demander au tracker des informations sur le nombre total de sources (seeds) et de sangsues (leech). Ce nombre sera généralement affiché entre parenthèses.

Seed : Une semence (seed en anglais) est un client partageant un fichier complet.

Share Ratio : Proportion entre le nombre de données envoyées, partagées (upload) et le nombre de données reçues (download). Dans un monde utopique, tout un chacun utilisant ce service devrait avoir un ratio de 1:1, ce qui indique qu'il y a eu autant de données envoyées que reçues. De cette façon les téléchargements sont rapides et efficaces pour tous.

Superseed : Seed qui tente de minimiser la quantité de données qu'il envoie jusqu'à ce qu'un leecher obtienne la totalité des fichiers du torrent et devienne ainsi un seeder supplémentaire. Cette fonction ne devrait être utilisée que lorsqu'il n'y a qu'un seul seeder, ou très peu d'entre eux, disponibles pour un torrent, et que ces seeders disposent de capacités d'upload relativement limitées. L'opération consiste à ne déclarer posséder aucune partie de l'archive. Lorsque les clients se connectent, le super-seeder les informe qu'il dispose d'une nouvelle partie que personne ne possède. Il transmet la partie en question et ne transmettra aucune nouvelle partie de l'archive tant qu'il n'aura pas reçu confirmation que la pièce a été transmise à nouveau. De cette manière, les clients en mode superseed n'envoient que les morceaux qui n'ont encore jamais été envoyés, et s'assurent de leur bonne diffusion par, et parmi, les leechers.

Swarm : Tous les peers (y compris les seeds) qui partagent un même torrent sont appelés un essaim. Par exemple, six peers et deux seeds forment un essaim de huit unités.

Torrent : Selon le contexte, un torrent peut signifier soit un fichier de métadonnées .torrent, soit tout fichier décrit par lui. Le fichier torrent contient des métadonnées relatives aux fichiers qu'il rend téléchargeables, y compris leurs noms, tailles et sommes de contrôle de toutes pièces du torrent. Il contient également l'adresse IP d'un tracker qui coordonne la communication entre les peers du swarm (essaim).

Tracker : Un traqueur est un serveur qui sait en permanence quels seeds et quels peers se trouvent dans le swarm (l'essaim). Les clients lui envoient régulièrement des informations. En échange, ils reçoivent des informations sur d'autres clients auxquels ils peuvent se connecter. Le traqueur n'est pas directement impliqué dans le transfert de données et ne possède pas de copie du fichier. Les quelques secondes par heure où un client se connecte au tracker sont en principe les seules où il puisse se faire identifier (mais par quelques dizaines d'autres clients seulement).

Implementations

Template:Main

The BitTorrent specification is free to use and many clients are open source, so BitTorrent clients have been created for all common operating systems using a variety of programming languages. The official BitTorrent client, µTorrent, Vuze, Transmission, and BitComet are some of the most popular clients. Template:Citation needed

Some BitTorrent implementations such as MLDonkey and Torrentflux are designed to run as servers. For example, this can be used to centralize file sharing on a single dedicated server which users share access to on the network.<ref>Torrent Server combines a file server with P2P file sharing</ref> Server-oriented BitTorrent implementations can also be hosted by hosting providers at co-located facilities with high bandwidth Internet connectivity (e.g., a datacenter) which can provide dramatic speed benefits over using BitTorrent from a regular home broadband connection.

Services such as ImageShack can download files on BitTorrent for the user, allowing them to download the entire file by HTTP once it is finished.

The Opera web browser supports BitTorrent,<ref>Template:Cite web</ref> as does Wyzo. BitLet allows users to download Torrents directly from their browser using a Java applet. Sites such as xFiles and DuShare allow to transfer big files directly using bittorrent inside adobe Flash.

An increasing number of hardware devices are being made to support BitTorrent. These include routers and NAS devices containing BitTorrent-capable firmware like OpenWrt.

Proprietary versions of the protocol which implement DRM, encryption, and authentication are found within managed clients such as Pando.

Development

An unimplemented (Template:As of) unofficial feature is Similarity Enhanced Transfer (SET), a technique for improving the speed at which peer-to-peer file sharing and content distribution systems can share data. SET, proposed by researchers Pucha, Andersen, and Kaminsky, works by spotting chunks of identical data in files that are an exact or near match to the one needed and transferring these data to the client if the "exact" data are not present. Their experiments suggested that SET will help greatly with less popular files, but not as much for popular data, where many peers are already downloading it.<ref>Template:Cite web</ref> Andersen believes that this technique could be immediately used by developers with the BitTorrent file sharing system.<ref>Template:Cite news</ref>

Template:As of, BitTorrent, Inc. is working with Oversi on new Policy Discover Protocols that query the ISP for capabilities and network architecture information. Oversi's ISP hosted NetEnhancer box is designed to "improve peer selection" by helping peers find local nodes, improving download speeds while reducing the loads into and out of the ISP's network.<ref>Arstechnica.com</ref>

Legal issues

Template:Main There has been much controversy over the use of BitTorrent trackers. BitTorrent metafiles themselves do not store file contents. Whether the publishers of BitTorrent metafiles violate copyrights by linking to copyrighted material without the authorization of copyright holders is controversial.

Various jurisdictions have pursued legal action against websites that host BitTorrent trackers. High-profile examples include the closing of Suprnova.org, Torrentspy, LokiTorrent, Mininova and OiNK.cd. The Pirate Bay torrent website, formed by a Swedish group, is noted for the "legal" section of its website in which letters and replies on the subject of alleged copyright infringements are publicly displayed. On 31 May 2006, The Pirate Bay's servers in Sweden were raided by Swedish police on allegations by the MPAA of copyright infringement;<ref>Template:Cite web</ref> however, the tracker was up and running again three days later.

BitTorrent and malware

Several studies on BitTorrent have indicated that a large portion of files available for download via BitTorrent contain malware. In particular, one small sample<ref>Template:Cite web</ref> indicated that 18% of all executable programs available for download contained malware. Another study <ref>Template:Citation</ref> claims that as much as 14.5% of BitTorrent downloads contain zero-day malware, and that BitTorrent was used as the distribution mechanism for 47% of all zero-day malware they have found.

See also

Template:Portal

References