01
Cloud Computing What is it? “Cloud computing is a style of computing in which dynamically scalable and often virtualized resources are provided as a service over the Internet” - Wikipedia
Silvio Revelli 01- 06 - 2009
Company Name
Cloud Computing vs Traditional ICT
01
Cloud Computing What is it? “Cloud computing is a style of computing in which dynamically scalable and often virtualized resources are provided as a service over the Internet” - Wikipedia a software system designed to support interoperable machine-to-machine interaction over a network.
Your machine asks, (you pay) you get.
Silvio Revelli 01- 06 - 2009
Company Name
Cloud Computing vs Traditional ICT
01
Cloud Computing What is it? “Cloud computing is a style of computing in which dynamically scalable and often virtualized resources are provided as a service over the Internet” - Wikipedia Computational Power (A virtual Machine), Memory (A virtual Storage), DB (A virtual Database) but also: Buffer, MapReduce, Content Distribution Obtimizers....
Silvio Revelli 01- 06 - 2009
Company Name
Cloud Computing vs Traditional ICT
01
Cloud Computing Why Cloud Computing?
Silvio Revelli 01- 06 - 2009
Company Name
Cloud Computing vs Traditional ICT
01
Cloud Computing Why Cloud Computing? Money!
no large investments - no opportunity costs - no customers lost
Silvio Revelli 01- 06 - 2009
Company Name
Cloud Computing vs Traditional ICT
01
Cloud Computing
Silvio Revelli 01- 06 - 2009
Company Name
Cloud Computing vs Traditional ICT
01
Cloud Computing
Cloud computing is a business solution!
It’s a device that allows costs-curve to follow the needs-curve.
Silvio Revelli 01- 06 - 2009
Company Name
Cloud Computing vs Traditional ICT
01
Cloud Computing
Cloud computing is a business solution!
It’s a device that allows costs-curve to follow the needs-curve. TWITTER
ANIMOTO
Social Network 1 Billion $ market cap Uses the Cloud for static content storage
Startup
Uses the Cloud for creating videos from images and sounds.
ELi LILLY
Pharmaceutical company 34.5 Billion $ market cap
Used the Cloud for clustering and research.
Silvio Revelli 01- 06 - 2009
NY TIMES
Journal Used the Cloud for converting 4TB of TIFFs into PDFs
Company Name
Cloud Computing vs Traditional ICT
01
Cloud Computing
Cloud computing is a business solution!
It’s a device that allows costs-curve to follow the needs-curve. TWITTER
ANIMOTO
Social Network 1 Billion $ market cap Uses the Cloud for static content storage
Startup
Uses the Cloud for creating videos from images and sounds.
ELi LILLY
Pharmaceutical company 34.5 Billion $ market cap
Used the Cloud for clustering and research.
Silvio Revelli 01- 06 - 2009
NY TIMES
Journal Used the Cloud for converting 4TB of TIFFs into PDFs
Company Name
Cloud Computing vs Traditional ICT
01
Cloud Computing A case study: AUGUST 2006 - FOUNDED They had an Idea and a source code (including the conversion algorithm)
SUNDAY, 13 -APRIL 2008 They had 25,000 registered users, but not very active in video making. They decide to let Facebook users make a free video with their own photos.
Silvio Revelli 01- 06 - 2009
Company Name
Cloud Computing vs Traditional ICT
01
Cloud Computing A case study: MONDAY They had an 25.000 users.
Silvio Revelli 01- 06 - 2009
Company Name
Cloud Computing vs Traditional ICT
01
Cloud Computing A case study: TUESDAY They had an 50.000 users.
Silvio Revelli 01- 06 - 2009
Company Name
Cloud Computing vs Traditional ICT
01
Cloud Computing A case study: THURSDAY They had an 250.000 users.
Silvio Revelli 01- 06 - 2009
Company Name
Cloud Computing vs Traditional ICT
01
Cloud Computing A case study:
It’s quite clear that in order to support such a rapid growth, they didn’t buy physical server machines. They used (and still using) the computational power provided by Amazon EC2. The CLOUD gives Animoto all the computational power on demand, switching on new server instances when required swithing off server instances when unused.
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
01
Cloud Computing A case study: Thursday (17-04-2008) 3500 server instances Monday (14-04-2008) 35 server instances
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
Cloud Computing A case study: If you think it phisically it means that you can turn this:
01
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
Cloud Computing A case study: If you think it phisically it means that you can turn this:
01
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
Cloud Computing A case study: If you think it phisically it means that you can turn this:
01
into this (x1000)
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
Cloud Computing A case study: If you think it phisically it means that you can turn this:
01
into this (x1000)
in just 3 days!!! Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
Cloud Computing By the way: in June 2009 Animoto was already cash-flow positive, received funding for $4.4M, had 13 employees, and more than 8 millions videos converted.
Amazon Web Services: In the following slides we’re going to brief these Amazon WS: EC2 (CloudWatch & MapReduce) S3 But we won’t discuss SimpleDB as it is a new school datastore paradigms (like Google BigTable) specifically designed for scalability, among other things.
01
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
01
Amazon EC2 You (or your automated application) can launch instances within seconds using REST or SOAP messages.
An instance can be small, medium or large. A small instance has: 1.7 GB of memory, 1.0-1.2 GHz 2007 Xeon processor (Single Core) 160 GB of instance storage 32-bit platform $0.10/hour
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
01
Amazon EC2 You (your application) have the full root control on the machine: this means that you can customize everything, also the OS. However, when you launch an EC2 instance, you don’t want to have a “virgin” machine whithout OS, or without installed applications. So you can launch an AMI (Amazon Machine Image). An AMI is somewhat like a “photo of a machine”. Amazon gives you a public archive of AMIs with pre-installed OS (commonly Linux), pre-installed useful applications (such as Apache) and full access control (commonly using SSH).
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
01
Amazon EC2 You can also create your AMI: after customizing your machine you can boundle it into an AMI (it’s like taking a photo). Then you must store it into your S3 bucket. Depending on ACL, only you or everyone can use that AMI. When you launch an instance from an AMI you should set your firewall configuration (eg: 80 HTTP, 22 SSH, 443 HTTPS, 3306 MySQL )
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
01
Amazon EC2 - CloudWatch When your application is up and running on your EC2 instance, you can monitor its status (CPU, Memory, Delays, I/O) with the recently introduced CloudWatch. CloudWatch includes 2 sub-features: Elastic Load Balancer (ELB) Autoscale
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
01
Amazon EC2 - CloudWatch Using Elastic Load Balancing, you can distribute incoming traffic across your Amazon EC2 instances. Elastic Load Balancing automatically scales its request processing capacity in response to incoming application traffic. Elastic Load Balancing can detect the health of Amazon EC2 instances. When it detects unhealthy load-balanced Amazon EC2 instances, it no longer routes traffic to those Amazon EC2 instances instead spreading the load across the remaining healthy Amazon EC2 instances.
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
01
Amazon EC2 - CloudWatch Auto Scaling enables you to set conditions (triggers on avarage fleet CPU, Memory, Delays, I/O) for when you want to scale up or down your Amazon EC2 usage. When one of the conditions is met, Auto Scaling automatically applies the action you’ve defined. Auto Scaling enables your application to scale up Amazon EC2 instances seamlessly and automatically when demand spikes. Auto Scaling allows you to automatically shed unneeded Amazon EC2 instances and save money when demand subsides.
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
01
Amazon EC2 - MapReduce Using Amazon Elastic MapReduce, you can instantly provision as much or as little capacity as you like to perform data-intensive tasks for applications such as web indexing, data mining, log file analysis, machine learning, financial analysis, scientific simulation, and bioinformatics research. Example: starting from a corpus we want to have the vector model.
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
01
Amazon EC2 - MapReduce Let’s start from the final result. The Output should be something like this: ((word,docname),TF*IDF)
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
01
Amazon EC2 - MapReduce In order to calculate this: ((word,docname),TF*IDF) we need to arrive to: ((word,docname),(tf,nt,df )) and N where: tf = term frequency in the document nt = total number of terms in document df = number of documents that contains term assuming that: TF = tf/nt IDF = log(N/df )
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
01
Amazon EC2 - MapReduce An so we cod a MapReduce function like this: TERM FREQUENCY Mapper Input: (docname,contents) Output: ((word,docname),1) Reducer: Sums the word in the document Outputs: ((word,document),tf )
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
01
Amazon EC2 - MapReduce An so we cod a MapReduce function like this: WORD COUNTS Mapper Input: ((word,document),tf ) Output: (docname,(word,tf )) Reducer: Sums the frequency of individual n’s in the document feeds original data Outputs: ((word,document),(tf,nt))
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
01
Amazon EC2 - MapReduce An so we cod a MapReduce function like this: WORD FREQUENCY IN CORPUS Mapper Input: ((word,document),(tf,nt)) Output: (word,(docname,tf,nt,1)) Reducer: Sums counts for word in corpus Outputs: ((word,document),(tf,nt,df )) That was what we need. With only a mapping activity we arrive at the vector space table: ((word,document),TF*IDF) Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
01
Amazon EC2 - MapReduce With Amazon MapReduce you can perform the map-reducing task with a potentially unlimited computational power. This is a screeshot of the implementation of tf on AWS (wordSplitter.py):
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
01
Amazon EC2 - MapReduce The higher the number of map-reduce instances the shorter the time to perform the entire job-flow.
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
01
Amazon EC2 - S3 Amazon S3:
Write, read, and delete objects containing from 1 byte to 5 gigabytes of data each. The number of objects you can store is unlimited. Each object is stored in a bucket and retrieved via a unique, developer-assigned key. Objects can be made private or public, and rights can be granted to specific users. Uses standards-based REST and SOAP interfaces designed to work with any Internet-development toolkit. Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT
01
Bye! Send me comments!
[email protected] http://silviorevelli.com
“Stay hungry stay foolish” - Steve Jobs
Silvio Revelli 01- 06 - 2009
MAIL:
[email protected] WEBSITE: www.silviorevelli.com
Cloud Computing vs Traditional ICT