List of distributed computing projects

From Wikipedia, the free encyclopedia

Jump to: navigation, search

A list of distributed computing projects.

Contents

[edit] The fastest Distributed Computers

The PlayStation 3 Folding@home client displays a 3D model of the protein being simulated
  • Folding@Home is of May 2009 sustaining over 4.3 PFLOPS [1], the first computing project of any kind to cross the four petaFLOPS milestone. This level of performance is primarily enabled by the cumulative effort of a vast array of PlayStation 3 and powerful GPU units.[2]
  • The entire BOINC averages over 1.5 PFLOPS as of March 15, 2009[3].
  • SETI@Home computes data averages more than 528 TFLOPS[4]
  • Einstein@Home is crunching more than 150 TFLOPS[5]
  • As of August 2008, GIMPS is sustaining 27 TFLOPS.[6]

Intel Corporation has recently unveiled the experimental multi-core POLARIS chip, which achieves 1 TFLOPS at 3.13 GHz. The 80-core chip can increase this result to 2 TFLOPS at 6.26 GHz, although the thermal dissipation at this frequency exceeds 190 watts[7].

As of 2008, the fastest PC processors (quad-core) perform over 70 GFLOPS (Intel Core i7 965 XE) in double precision[8]. GPUs are considerably more powerful, for example, in the GeForce 200 Series the nVidia GTX 280 performs around 933 GFLOPS on 240 processing elements in single precision calculations[9]. However, while GPUs are highly efficient at single precision calculations they are not as flexible as a general purpose CPUs in double precision operations.

[edit] Cycle Scavenging Infrastructure

[edit] Berkeley Open Infrastructure for Network Computing (BOINC)

Berkeley Open Infrastructure for Network Computing (BOINC), originally developed to support SETI@home, became useful as a platform for several distributed applications in areas as diverse as mathematics, medicine, molecular biology, climatology, and astrophysics.[10]

The Berkeley Open Infrastructure for Network Computing (BOINC) is currently the most popular volunteer-based distributed computing platform as of April 2009.[11][12]

[edit] Active Projects

[edit] Upcoming Projects

These projects are considered to be in the Alpha or Beta development stages.

  • Art
    • BURP — to develop a publicly distributed system for rendering 3D animations. (Alpha)

Performance of BOINC projects[41]:

  • over 1,650,000 participants (over 290,000 active)
  • over 3,950,000 computers (over 500,000 active)
  • over 1.5 PetaFLOPS (more than supercomputer Blue Gene)

[edit] Distributed.net

Distributed.net runs several projects:

[edit] World Community Grid

The World Community Grid is an IBM philanthropic initiative which aims to create the largest public computing grid benefiting humanity. It utilizes the BOINC platform.

[edit] Active Projects

  • Human Proteome Folding Project - Phase 2 — predicting functions of proteins in conjunction with rosetta@home.
  • FightAIDS@Home — identify candidate drugs that have the right shape and chemical characteristics to block HIV protease.
  • Discovering Dengue Drugs – Together — uncover novel drugs to cure dengue hemorrhagic fever, hepatitis C, West Nile encephalitis, and Yellow fever.
  • Help Conquer Cancer — improve the results of protein X-ray crystallography in order to increase understanding of cancer and its treatment.
  • Nutritious Rice for the World — Predict the protein structures of rice in order to help rice breeders create more abundant, resilient and nutritious harvests.
  • The Clean Energy Project[1][2] Trying to find the best organic substances for the next generation's solar cells and storing energy
  • Help Fight Childhood Cancer[3] The mission of the Help Fight Childhood Cancer project is to find drugs that can disable three particular proteins associated with neuroblastoma, one of the most frequently occurring solid tumors in children. Identifying these drugs could potentially make the disease much more curable when combined with chemotherapy treatment.
  • Influenza Antiviral Drug Search[4] The mission of the Influenza Antiviral Drug Search project is to find new drugs that can stop the spread of an influenza infection in the body. The research will specifically address the influenza strains that have become drug resistant as well as new strains that are appearing. Identifying the chemical compounds that are the best candidates will accelerate the efforts to develop treatments that would be useful in managing seasonal influenza outbreaks, and future influenza epidemics and even pandemics.
  • Help Cure Muscular Dystrophy - Phase 2 — [5] World Community Grid and researchers supported by Decrypthon, a partnership between AFM (French Muscular Dystrophy Association), CNRS (French National Center for Scientific Research), Universite Pierre et Marie Curie, and IBM are investigating protein-protein interactions for more than 2,200 proteins whose structures are known, with particular focus on those proteins that play a role in neuromuscular diseases. The database of information produced will help researchers design molecules to inhibit or enhance binding of particular macromolecules, hopefully leading to better treatments for muscular dystrophy and other neuromuscular diseases.

[edit] Completed Projects

[edit] Commercial Cycle Scavengers

[edit] Gomez Peer

Commercial distributed computing project that uses peers' computers to measure the real-world performance of websites. They pay the active peers 0.10$ for being online more than 20 hours a day and 0.0005$ for each processing minute. Each peer can install their program on more than one computer and sum up online and processing time. Payments are distributed by PayPal (you must have a minimum of $5 to get a payment).

[edit] Parabon Computation

The Parabon Computation client uses a Java VM technology, and is commercial in nature.

[edit] Projects funded by the European Union

  • neuGRID is a European Union-funded project within the frame of the Information&Communication and Technologies Work Frame. The aim of the project is to develop a new user-friendly Grid-based research e-Infrastructure enabling the European neuroscience community to carry out research required for the pressing study of degenerative brain diseases (for example, the Alzheimer disease).
  • D4Science is a European e-Infrastructure project co-funded by the European Commission's Seventh Framework Programme for Research and Technological Development. The project aims to continue the path that the GÉANT, EGEE, and DILIGENT projects have initiated towards establishing networking, grid-based, and data-centric e-Infrastructures that accelerate multidisciplinary research by overcoming barriers related to heterogeneity, sustainability and scalability.

[edit] Custom/Uncategorized Platforms

Custom software encompasses distributed computing projects that do not make use of a third-party generic client-server infrastructure or which use one other than those listed above.

Note that not all of these projects use cycle-scavenging technology; some instead focus on more traditional remote-access-to-HPC approaches.

[edit] Active projects

  • Mathematics
  • Internet
    • AssessGrid [8] Addresses obstacles to a wide adoption of Grid technologies by bringing risk management and assessment to this field, enabling use of Grid computing in business and society.
    • A-Ware [9] will develop a stable, supported, commercially exploitable, high quality technology to give easy access to Grid resources.
    • Cohesion Platform [10] is a Java-based modular Peer-to-Peer multi-application Desktop Grid computing platform for irregularly structured problems developed at the University of Tübingen (Germany).
    • DIMES — is a distributed computing project which maps the structure and evolution of the Internet infrastructure, allowing users to see how the Internet looks from their home.
    • Enabling Grids for E-sciencE
    • GridCOMP [11] — provides an advanced component platform for an effective invisible Grid.
    • GridECON [12] takes a user-oriented perspective and creates solutions to grid challenges to promote the widespread use of grids.
    • Hours — Ongoing project HarmOny and Useful Resource Sharing. Attempts to make use of the trust management and network economics to implement the heterogeneous resource sharing. Currently focusing on the resource allocation in the science grid like Teragrid and OSG. This project is run by the MIST group of Computer Science at Wayne State University.[13]
    • JHDC — Open source programmable Java distributed computing system.
    • Legion — Grid computing platform developed at the University of Virginia.
    • Majestic-12 — Uses a distributed web crawler program to index web sites for a distributed search engine.[14]
    • NESSI-GRID [15] aims to provide a unified view for European research in Services Architectures and Software Infrastructures that will define technologies, strategies and deployment policies fostering new, open, industrial solutions and societal applications that enhance the safety, security and well-being of citizens.
    • OMII-Europe is an EU-funded project which has been established to source key software components that can interoperate across several heterogeneous Grid middleware platforms.
    • OMII-UK provides free Open Source software and support to enable a sustained future for the UK e-Research community.
    • OurGrid — aims to deliver grid technology that can be used today by current users to solve present problems. To achieve this goal, OurGrid chooses a different trade-off compared to most grid projects. It forfeits supporting arbitrary applications in favor of supporting only Bag-of-Tasks applications.
    • ScottNet NCG — This is a distributed neural computing grid. A private commercial effort in continuous operation since 1995. This system performs a series of functions including data synchronization amongst databases, mainframe systems, and other data repositories. E-Commerce transaction processing, automated research and data retrieval, content analysis, web site monitoring, scripted and dynamic user emulation, shipping and fulfillment API integration and management, RSS and NNTP monitoring and analysis, real time security enforcement, and backup / restore functionality.[16]
  • Biology
  • Chemistry
  • Earth Sciences
    • Climateprediction.net — seeks to forecast the climate of the Earth in the 21st century. The original windows client is in process of being retired. At this time the windows client is used for Open University classes only.
  • Art
  • Physics
  • Engineering
  • Cryptography
  • Miscellaneous

[edit] Completed projects

[edit] Abandoned projects

These projects were either abandoned outright or in some cases merged with other larger ongoing projects.

[edit] Upcoming projects

  • Storage@home — distributed storage infrastructure developed to solve the problem of backing up and sharing petabytes of scientific results using a distributed model of volunteer managed hosts. Data is maintained by a mixture of replication and monitoring, with repairs done as needed.

[edit] Volunteer distributed computing projects

Popular projects in volunteer distributed computing include:[51][not in citation given]

Project Start Affiliation Area Peak_#hosts Current status Computing power
GIMPS 1996 ? mathematics 10,000 active 27 TFLOPS
distributed.net 1997 U.S. non-profit organization cryptography 100,000 active ?
SETI@home 1999 University of California, Berkeley SETI 362,000 active 528 TFLOPS
Electric Sheep 1999 ? art 57,000 active ?
Folding@home 2000 Stanford University biology 406,000 active 8.1 PFLOPS
BOINC 2002 University of California, Berkeley biomedicine, other 550,000 active 1.5 PFLOPS
Grid.org 2002 philanthropic by United Devices biomedicine, other 3,734,000[52] closed ?
Climateprediction.net 2003 University of Oxford climate change 150,000 active ?
LHC@home 2004 CERN physics 60,000 active ?
World Community Grid 2004 philanthropic by IBM biomedicine, other 700,000[52] active ?
Einstein@home 2005 LIGO astrophysics 200,000 active 150 TFLOPS
Rosetta@home 2005 University of Washington biology 100,000 active ?

[edit] Physical infrastructure projects

These projects attempt to make large physical computation infrastructures available for researchers to use:

[edit] Other distributed computing software platforms

The following are generic software platforms or infrastructures used to implement some of the projects listed in the previous section.

  • Alchemi — A .NET-based system for building enterprise Grids and applications.
  • Amoeba — distributed operating system that is designed for distributed computing tasks.
  • Base One Foundation Component LibraryRAD framework for database-centric distributed computing.
  • Beowulf clustersLinux based parallel computing using commodity hardware.
  • Condor — a flexible high-throughput distributed computing scheduler
  • DCEZ — Simple to set up and use platform to perform distributed computing with a minimum of infrastructure.
  • Distributed objects — systems like CORBA, Microsoft DCOM, Java RMI, and others that try to map object oriented design onto the network.
  • DragonFly BSD — an operating system aiming to support SSI clustering
  • Globus Toolkit — an open source software toolkit used for building Grid systems and applications
  • GreenTea Software — a Java-based P2P generic distributed network computing platform that transmits code and data on-demand to run on heterogeneous OS's.
  • Gridbus Toolkit — an open source software toolkit used for building market-oriented Grid systems and applications
  • Grid MP — an infrastructure created by United Devices, used to run grid.org, and is one of the infrastructures used by World Community Grid.
  • JPPF — an open source computational grid toolkit focused on performance and ease of use
  • JSTM — uses a java Software Transactional Memory implementation for distributed object replication.
  • Popular Power — (Defunct) building a platform for Internet-wide distributed computing.
  • ProActive ProActive is a Java middleware (part of the ObjectWeb consortium, with Open Source code) for parallel, distributed and multi-threaded computing.
  • RPyC — Remote Python Call, a platform for building distributed applications.
  • Sun Grid Engine — a distributed resource management system, similar to Condor
  • SynfiniWayFujitsu's middleware with which a virtualised IT framework can be created that provides a uniform and global view of resources within a department, a company, or a company with its suppliers.
  • Terracotta — Open source Java clustering - Java extension that provides seamless clustering of plain java code. See also Terracotta Cluster.
  • UNICORE — an open source software platform for supporting Grid systems and applications
  • Vaakya — software developed by Vaakya Technologies Pvt. Ltd., a Bangalore-based company. It has its own language and different frameworks (e.g. business application components, handheld devices, 3D graphics) that allow ISVs to develop applications, particularly for businesses, that run entirely on premises on ordinary work stations, not expensive servers.
  • XGE — a Windows-based product which distributes tasks on a local network by virtualizing filesystem access.
  • Xgrid — software developed by Apple's Advanced Computation Group.

Comparison of cluster software

[edit] References

  1. ^ "Client statistics by OS". Folding@Home. 2009-01-22. http://fah-web.stanford.edu/cgi-bin/main.py?qtype=osstats. Retrieved on 2009-01-23. 
  2. ^ Staff (November 6, 2008). "Sony Computer Entertainment's Support for Folding@home™ Project on PlayStation®3 Receives This Year's "Good Design Gold Award"". Sony Computer Entertainment Inc.. Sony Computer Entertainment Inc. (Sony Computer Entertainment Inc.). http://www.scei.co.jp/corporate/release/081106de.html. Retrieved on December 11, 2008. 
  3. ^ "Credit overview". BOINC. http://www.boincstats.com/stats/project_graph.php?pr=bo. Retrieved on 2008-08-04. 
  4. ^ "SETI@Home Credit overview". BOINC. http://www.boincstats.com/stats/project_graph.php?pr=sah. Retrieved on 2008-08-04. 
  5. ^ "Server Status". Einstein@Home. http://einstein.phys.uwm.edu/server_status.php. Retrieved on 2008-07-08. 
  6. ^ Internet PrimeNet Server Parallel Technology for the Great Internet Mersenne Prime Search
  7. ^ http://www.bit-tech.net/hardware/2007/04/30/the_arrival_of_teraflop_computing/2
  8. ^ "Intel Core i7 Performance Preview". TECHGAGE. 2008-11-03. http://techgage.com/article/intel_core_i7_performance_preview/9. Retrieved on 2008-11-17. 
  9. ^ http://www.tomshardware.com/reviews/nvidia-gtx-280,1953-2.html
  10. ^ BOINC - Berkeley Open Infrastructure for Network Computing, Dr. David Anderson describes SETI@home, BOINC and Distributed Computing, youtube.com
  11. ^ BOINCStats, active hosts.
  12. ^ Folding@Home, active hosts.
  13. ^ "POEM@Home website
  14. ^ "BRaTS@Home" website
  15. ^ "Milkyway@home" website
  16. ^ "Distributed Exact Cover Solver" website
  17. ^ Ramsey@Home website
  18. ^ Rectilinear Crossing Number website
  19. ^ "NQueens@Home" website
  20. ^ "pPot Tables" website
  21. ^ "Project Sudoku" website
  22. ^ FreeHAL@home website
  23. ^ "Cels@Home" website
  24. ^ "Docking@Home" website
  25. ^ "DrugDiscovery@Home" website
  26. ^ "Genetic Life" website
  27. ^ "GPUGRID.net" website
  28. ^ Hydrogen@Home website
  29. ^ "SciLINC" website
  30. ^ Botanicus.org description of SciLINC
  31. ^ "Superlink@Technion" website
  32. ^ "Virtual Prairie" website
  33. ^ "AQUA@home" website
  34. ^ "Leiden Classical" website
  35. ^ LHC@home website
  36. ^ "Magnetism@home" website
  37. ^ "RND@home" website
  38. ^ "APS@Home" website
  39. ^ "Second Computing" website"
  40. ^ "Yoyo@home" website
  41. ^ BOINCstats | BOINC combined - Credit overview
  42. ^ "www.Bio4All.Tk" website
  43. ^ "sharkGrid" website
  44. ^ Xiaoyun Wang, Dengguo Feng, Xuejia Lai, Hongbo Yu: Collisions for Hash Functions MD4, MD5, HAVAL-128 and RIPEMD, Cryptology ePrint Archive Report 2004/199, 16 Aug 2004, revised 17 Aug 2004.
  45. ^ "Eternity2.net" website
  46. ^ "DepSpid" website
  47. ^ "Project Neuron" website
  48. ^ "RenderFarm@Home" website
  49. ^ "SHA-1 Collision Search" website
  50. ^ 3x+1@home website
  51. ^ David P. Anderson (2005-05-23). A Million Years of Computing. http://boinc.berkeley.edu/talks/singapore_public.pdf. Retrieved on 2009-01-30. 
  52. ^ a b Host numbers from the UD platform represent unique installations, so are greater than the number of actual computers.

[edit] External links

Personal tools