viadeo - cost driven development

12
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Frédéric d’Alessandro - Data architect Loïc Dias Da Silva - CTO & VP Engineering Cost Driven Development chez Viadeo Tâches Spark sur EMR optimisées avec des instances SPOT

Upload: julien-simon

Post on 15-Feb-2017

39 views

Category:

Technology


2 download

TRANSCRIPT

Page 1: Viadeo - Cost Driven Development

© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

Frédéric d’Alessandro - Data architect

Loïc Dias Da Silva - CTO & VP Engineering

Cost Driven Development chez ViadeoTâches Spark sur EMR optimisées avec des instances SPOT

Page 2: Viadeo - Cost Driven Development

Viadeo

SUCCESS

Business

Fournisseurs, clients,

collaborateurs,

investisseurs...

Networking

Pairs, associés, collègues,

camarades, rencontres

pro, partenaires ...

Carrière

Premier job, recherche

d’emploi, entreprenariat,

développement, ...

Page 3: Viadeo - Cost Driven Development

Migration vers le cloud AWS

Nov. 2014

Escape to Amazon thinking

Dec. 2014

First commit Infrastructure as a code

Mar. 2015

First industrialized instance on AWS cloud

Dec. 2015

Our demo environment is on AWS cloud

July 2016

Our live production environment is on AWS cloud

Dec 2016

Datacenter is closed

demo

Page 4: Viadeo - Cost Driven Development

Cost-Driven Development

- choix des instances Amazon EC2- autoscaling- optimisations Amazon RedShift- cycle de vie des données Amazon S3- utilisation d’instances SPOT

Page 5: Viadeo - Cost Driven Development

Architecture

Page 6: Viadeo - Cost Driven Development

L’analytics chez Viadeo

Page 7: Viadeo - Cost Driven Development

Le Big Data chez Viadeo

Page 8: Viadeo - Cost Driven Development

Utilisation d’instances SPOT (ex: SnowPlow)- Prix c3.8xlarge à la demande : 1.68$- Prix c3.8xlarge Spot : ~0.40$- Baisse du coût de ~ 1200$ à ~240$ par mois

Page 9: Viadeo - Cost Driven Development

pop.sh : automatisation du lancement de jobs

Page 10: Viadeo - Cost Driven Development

Usage

$ ./pop.sh --coreInstanceType=m4.large --coreInstanceCount=2 --ec2spot=true

(...)

# Pricing info = Tue Oct 11 08:00:09 CEST 2016

# Instance type = m4.large

# availabilityZone = us-west-1c

# Aws Spot Price 1 min ago = 0.016700

# Aws Spot Price 1 day ago = 0.016300

# Our max Spot Price max proposal = 0.029

# Aws fixed price = 0.14

# Emr fixed price = 0.030

# Do we take spot = 1

$

Page 11: Viadeo - Cost Driven Development

Prochaines étapes

- Amélioration de la vitesse de scaling- Passage des stacks applicatives en µ-services- Containerisation & orchestration

Page 12: Viadeo - Cost Driven Development