Scaling python webapps from 0 to 50 million users - A top-down approach

Scaling python webapps from 0 to 50 million users - A top-down approachJinalJhaveriSystems Architectjinal@lolapps.com

AgendaWhy is performance a big issue for social games?ArchitectureBottlenecks and solutionsPerformance strategyQuestions

Why is performance a big issue for social games?

Why is performance a big issue for social games? Extremely high viralityinstalls, notifications, emails, feeds, eventsAmount of time spent is highhttp://www.bestfacebookapplications.com

Bottlenecks and SolutionsLoad BalancerWeb ServerWeb ApplicationBrowser

Load BalancerHAProxyRoundrobinNo gzip / no file servingSupports ipbased / regex based load balancing

WebserverPastern instances (n = no.ofcpu) (10 threads each)Timeout (10 seconds)Disable Nagles optimization

Web ApplicationMemcached to avoid DB tripsORM integrationCompressionCaching non-existenceLists cache

Memcache / ORM integration def get(self, query, ident, *args, **kwargs): key = query.mapper.identity_key_from_primary_key(ident) obj = query.session.identity_map.get(key) if obj: return obj mkey = gen_cache_key(key[0].__name__, key[1], self.version_string) obj = self.mclient.get(mkey) if obj is None:obj = query._get(key, ident, **kwargs) if obj is not None: query.session.expunge(obj) self.mclient.set(mkey, obj) if obj: return query.session.merge(obj, dont_load=True) else: return NoneMike Nelson

Memcached to avoid DB tripsORM integrationCompressionCaching non-existenceLists cacheBest Effort caching

Web ApplicationLocal cachePythonSpeed/PerformanceTips (wiki.python.org)AsynchronousFacebookapi callsLog processingEvent trackingPartial rendering / json / ajax

Browser – Best practicesGzipCDNLoading images in parallelAjaxificationClient side caching

Gzipfrom paste.gzipper import make_gzip_middlewareapp = make_gzip_middleware(app, global_conf, compress_level=1)

Performance strategyProfileImproveProfileImproveProfileImproveMeasure, Measure, Measureload balancer request timeWeb server request timeController request timeRendering time

Profiling - Middlewareclass TimerMiddleware(object): """ Simple middleware that logs the time of each request to the provided logger. @author Brian Rue """ def __init__(self, app, log, name='outer'): self.app = app self.log = log self.name = name def __call__(self, environ, start_response): start_time = time.time() try: return self.app(environ, start_response) finally: end_time = time.time() url = environ.get('PATH_INFO', '') if environ.get('QUERY_STRING'): url += '?' + environ['QUERY_STRING'] self.log.debug("%f %s-%s" % ((end_time - start_time), self.name, url))

Profiling2010-02-07 13:27:07 0.182282 mission/index /app/20/mission user: 1000002704984422010-02-07 13:27:07 0.105489 outer-/app/20/mission?dummyid=12010-02-07 13:27:07 0.287437 battle/attack /app/19/battle/attack user: 5018791262010-02-07 13:27:07 0.006339 track/record_event /app/21/track/record_event user: 11635112662010-02-07 13:27:07 0.032981 outer-/app/21/track/record_event?file_name=base.js&cache_key=22850012647231752010-02-07 13:27:07 0.006186 track/record_event /app/19/track/record_event user: 10396625362010-02-07 13:27:07 0.072400 outer-/app/19/track/record_event?file_name=base.js&cache_key=2285001265425258

Profiling - Repoze # establish the Registry for this application app = registry.RegistryManager(app)from repoze.profile.profiler import AccumulatingProfileMiddlewareapp = AccumulatingProfileMiddleware(app, log_filename='/tmp/gameprofile.log', cachegrind_filename='/tmp/cachegrind.out.bar', discard_first_request=True, flush_at_shutdown=True, path='/__profile__')

Profiling - Repozencalls: number of callstottime: time spent in given function and excluding the time spent in sub-functionspercall: tottime / ncallscumtime: total time spent in this and all sub-functions.percall: cumtime / ncallsfilename:lineno(function): function info.

Profiling - Dozerfrom dozer import Dozer, Logviewapp = Logview(app, config)app = Dozer(app)

Database Optimistic vs. Pessimistic lockingversion_idUpdate table set data = xyz where version = 16.SQLAlchemy(echo, echo_pool and logger)Remove/rollbackProcess1DataVal = “abc”Version: 16 Process2DataVal = “pqr”Version:16Data: Val = xyzVersion: 16

TornadoUsed over WSGICPU and Memory usage downDidn’t do well for high response sizeAppropriate for asynchronous / realtime

AcknowledgementsLolapps teamBrian Rue, AJ Cantu, Fred Blau, Cory Virok, Justin Rosenthal, Joseph Estrada, Allen Cheung, VivekTatineni, Jason Kim, VikramAdukiaFamily

Scaling python webapps from 0 to 50 million users - A top-down approach

Related slideshows

Recommended for you

Recommended for you

Recommended for you

Recommended for you

Recommended for you

Recommended for you

Recommended for you

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Scaling python webapps from 0 to 50 million users - A top-down approach

Similar to Scaling python webapps from 0 to 50 million users - A top-down approach (20)

Recently uploaded

Recently uploaded (20)

Scaling python webapps from 0 to 50 million users - A top-down approach