ampcamp.berkeley.edu
Mini Course | UC Berkeley AMP Camp
http://ampcamp.berkeley.edu/big-data-mini-course-home
UC Berkeley AMP Camp. The UC Berkeley Big Data AMP Camp, featuring tutorials on popular open-source software including Spark, Shark, Hive, and Mesos; overviews of the Berkeley Data Analytics System (BDAS); and Machine Learning tutorials. Skip to primary content. Skip to secondary content. AMP Camp aims to teach practitioners how to use the open source tools being built and released by the AMPLab. Note: you will be billed for the EC2 time you use). Air Jordan 9 Mens. If you’re looking for the versio...
prof.ict.ac.cn
Download | BigDataBench
http://prof.ict.ac.cn/BigDataBench/dowloads
A Big Data Benchmark Suite, ICT, Chinese Academy of Sciences. Download User Manual, Technical Report and Specification. BigDataBench 3.2 User Manual [ BigDataBench-UserManual. BigDataBench JStorm User Manual [ BigDataBench-JStorm-UserManual. BigDataBench Spark Streaming User Manual [ BigDataBench-SparkStreaming-UserManual. BigDataBench 3.2 Technical Report [ BigDataBench-TechnicalReport. BigDataBench 3.2 Specification [ BigDataBench-specification. Table 1: The Summary of Data Sets. Text Generator of BDGS.
prof.ict.ac.cn
BigDataBench | A Big Data Benchmark Suite, ICT, Chinese Academy of Sciences
http://prof.ict.ac.cn/BigDataBench/old/3.0
A Big Data Benchmark Suite, ICT, Chinese Academy of Sciences. News: User group on Linkedin. BigDataBench subset and simulator version. For architecture communities released. Multi-media data sets and workloads available soon. A tutorial. On BigDataBench at Micro 2014. December 13-17, 2014, Cambridge, UK). As a multi-discipline—e.g., system, architecture, and data management—research effort, BigDataBench is a big data benchmark suite (please refer to our summary paper. Of BigDataBench, which allows the fl...
jerryshao.me
传统的MapReduce框架慢在那里
http://jerryshao.me/architecture/2013/04/15/传统的MapReduce框架慢在哪里
本文就两个问题进行讨论 1. 相比于 Shark. 本文翻译自 Shark: SQL and Rich Analytics at Scale. 的优势在哪里,原文可见 http:/ www.eecs.berkeley.edu/Pubs/TechRpts/2012/EECS-2012-214.pdf. 最后,对于RDD我们还未挖掘其随机读取的能力,虽然对于写入操作,RDD只能支持粗粒度的操作,但对于读取操作,RDD可以精确到每一条记录[6],这使得RDD可以用来作为索引, Tenzing 可以用此来作为join操作的远程查询表. 为了能够解决这个问题,我们提出了partial DAG execution (PDE),这使得Spark能够在基于数据统计的基础上改变后续执行计划图,PDE与其他系统(DryadLINQ)的运行时执行计划图重写的不同在于 它能够收集键值范围内的细粒度统计数据 能够完全重新选择join的执行策略,如broadcast join,而不仅仅是选择Reduce任务的个数。 11] http:/ aws.amazon.com/about-aws/whats-new/2010...12] M...
datasalt.es
El estado actual del "SQL para Hadoop" - DatasaltDatasalt
http://www.datasalt.es/2014/04/el-estado-actual-del-sql-para-hadoop
El estado actual del “SQL para Hadoop”. Escrito por Pere Ferrera Bertran. En abril 3, 2014. Desde la adopción masiva de Hadoop, muchas herramientas open-source y propietarias han aparecido que tratan de resolver el problema de “hacer consultas sobre Hadoop”. Hadoop comenzó como un simple (aunque poderoso) framework Big Data consistente en (1) una capa de almacenamiento (HDFS, inspirado GFS. Cómo es el conector? Cómo de maduro es? Y cómo de eficiente es? Después de unos años de fuerte tendencia “NoS...
decrypt.ysance.com
Decrypt » septembre 2014
http://decrypt.ysance.com/2014/09
Le site de decryptage des technologies de l'informatique. Tutorial: install Varnish VMODs from sources & build a Paywall. Cinématique Paywall & Profiling de l’internaute. Big Data avec Hadoop : comparatif Hive, Pig, Impala, Shark & Spark. Décryptage CoreOS, Mesos & Kubernetes. Retour d’Expérience d’une Intégration Continue avec Docker/Gitlab/Jenkins. Infrastructure Cloud AWS Vs Infrastructure physique : LA réponse! Dans Infrastructure Cloud AWS Vs Infrastructure physique (1/2). Je vous propose donc un br...
workinganalytics.com
Connections – Working Analytics
http://workinganalytics.com/connections
Building analytics experience and skills. For older Meetup reports, please scroll down. Big Data = Big Business Meetup, May 29th, 2014, Sammamish, WA. The main speaker this evening, Rolfe Lindberg, of Double Down Interactive. Gave the most impressive and convincing presentation I have seen on BI systems this year. Underlying the story of a young company’s success and a BI function’s progression, is a tale of great execution. Rolfe’s presentation is available here: BI Buildout. The next major advance was ...
unlimited-data.blogspot.com
Unlimited-Data. moved to lab.itbee.vn : May 2013
http://unlimited-data.blogspot.com/2013_05_01_archive.html
Unlimited-Data. moved to lab.itbee.vn. The entries in this blog are really interesting to me AND are selected over Internet. Friday, 31 May 2013. SQL is what’s next for Hadoop: Here’s who’s doing it. SQL is what’s next for Hadoop: Here’s who’s doing it. When we first began putting together the schedule for Structure: Data. Of course, Facebook began this whole movement to bring SQL database-like functionality to Hadoop when it created Hive in 2009. Hive, now an Apache project. And keep in mind that this n...