Matches in DBpedia 2015-04 for { <http://dbpedia.org/resource/Apache_Spark> ?p ?o }
Showing triples 1 to 39 of
39
with 100 triples per page.
- Apache_Spark abstract "Apache Spark is an open-source cluster computing framework originally developed in the AMPLab at UC Berkeley. In contrast to Hadoop's two-stage disk-based MapReduce paradigm, Spark's in-memory primitives provide performance up to 100 times faster for certain applications. By allowing user programs to load data into a cluster's memory and query it repeatedly, Spark is well suited to machine learning algorithms.Spark requires a cluster manager and a distributed storage system. For cluster manager, Spark supports standalone (native Spark cluster), Hadoop YARN, or Apache Mesos. For distributed storage, Spark can interface with a wide variety, including Hadoop Distributed File System (HDFS), Cassandra, OpenStack Swift, and Amazon S3. Spark also supports a pseudo-distributed mode, usually used only for development or testing purposes, where distributed storage is not required and the local file system can be used instead; in the scenario, Spark is running on a single machine with one worker per CPU core.Spark has over 465 contributors in 2014, making it the most active project in the Apache Software Foundation and among Big Data open source projects.".
- Apache_Spark thumbnail Spark-logo-192x100px.png?width=300.
- Apache_Spark wikiPageExternalLink spark.apache.org.
- Apache_Spark wikiPageExternalLink spark.apache.org.
- Apache_Spark wikiPageExternalLink graphx.
- Apache_Spark wikiPageExternalLink mllib.
- Apache_Spark wikiPageExternalLink sql.
- Apache_Spark wikiPageExternalLink streaming.
- Apache_Spark wikiPageID "42164234".
- Apache_Spark wikiPageRevisionID "644691228".
- Apache_Spark developer "Apache Software Foundation, UC Berkeley AMPLab, Databricks".
- Apache_Spark genre "data analytics, machine learning algorithms".
- Apache_Spark latestReleaseVersion "v1.2.0".
- Apache_Spark license "Apache License 2.0".
- Apache_Spark logo File:Spark-logo-192x100px.png.
- Apache_Spark name "Apache Spark".
- Apache_Spark operatingSystem Linux.
- Apache_Spark operatingSystem Mac_OS.
- Apache_Spark operatingSystem Microsoft_Windows.
- Apache_Spark programmingLanguage Java_(programming_language).
- Apache_Spark programmingLanguage Python_(programming_language).
- Apache_Spark programmingLanguage Scala_(programming_language).
- Apache_Spark status "Active".
- Apache_Spark website spark.apache.org.
- Apache_Spark subject Category:Apache_Software_Foundation.
- Apache_Spark subject Category:Cluster_computing.
- Apache_Spark subject Category:Data_mining_and_machine_learning_software.
- Apache_Spark subject Category:Hadoop.
- Apache_Spark subject Category:Software_using_the_Apache_license.
- Apache_Spark subject Category:University_of_California,_Berkeley.
- Apache_Spark comment "Apache Spark is an open-source cluster computing framework originally developed in the AMPLab at UC Berkeley. In contrast to Hadoop's two-stage disk-based MapReduce paradigm, Spark's in-memory primitives provide performance up to 100 times faster for certain applications. By allowing user programs to load data into a cluster's memory and query it repeatedly, Spark is well suited to machine learning algorithms.Spark requires a cluster manager and a distributed storage system.".
- Apache_Spark label "Apache Spark".
- Apache_Spark sameAs m.0ndhxqz.
- Apache_Spark sameAs Q7573619.
- Apache_Spark sameAs Q7573619.
- Apache_Spark wasDerivedFrom Apache_Spark?oldid=644691228.
- Apache_Spark depiction Spark-logo-192x100px.png.
- Apache_Spark homepage spark.apache.org.
- Apache_Spark isPrimaryTopicOf Apache_Spark.