Slides from my presentation about Shopzilla's concurrency strategies to the Pasadena Java User's Group on April 26, 2010. This is essentially the same material as covered by my colleague Rodney Barlow in an earlier presentation http://www.slideshare.net/rodneypbarlow/shopzilla-on-concurrency, with a few minor tweaks.
Report
Share
Report
Share
1 of 37
More Related Content
Shopzilla On Concurrency
1. Shopzilla on Concurrency Concurrency as Shopzilla's performance building block Will Gage, Lead Software Engineer 3/2/2010
2. Agenda Introduction History of Java Concurrency Java 5 Concurrency Features Concurrency in Frameworks Concurrency @ Shopzilla Future
3. Shopzilla, Inc. - Online Shopping Network 100M impressions/day 20-29M UV’s per Month 8,000+ searches per second 100M+ Products
4. Why do we care about concurrency? Correctness Avoiding race conditions Avoiding visibility problems Avoiding “liveness” issues Performance The limits of Moore’s Law means the rise of Amdahl’s Law
5. Amdahl’s Law P = portion of your code which can be parallelized N = number of processors (Thanks, Wikipedia!)
6. History of Java Concurrency Concurrent code pre Java 1.5 was difficult and error prone. Misinformation abounded Java users need to write reliable multi-threaded software! Doug Lea's concurrent package (circa 1998) JSR-133 Java Memory Model (threads, locks, volatiles, ...) JSR-166 Concurrency Utilities Expert groups consisting of Bloch, Goetz, and Lea
7. Where is Concurrent Code Concurrent code is in our application containers Concurrent code is in the frameworks we all use Concurrent code is increasingly found in our applications
9. Immutability Immutability = a class whose instances can't be modified Eg; String, boxed primitives, BigDecimal, BigInteger Joshua Bloch's Effective Java sets forth guidelines Eliminate mutators Eliminate extensibility All fields final Exclusivity for mutable components Further, Bloch's Effective Java reminds us Immutable objects are inherently thread safe. Immutable objects require no synchronization Immutable objects can be shared freely
10. Atomic References Immune to deadlock and other liveness issues Offer nonblocking synchronization of single variables Offer lower scheduling overhead than traditional synchronization techniques Immune to deadlock and other liveness issues Effectively volatile variables with extra features Modern hardware support through compare-and-swap processor instructions
11. Atomic References – Unique ID We needed an unique ID value Unique across multiple data-centers, and silos Configuration elements prime the singleton IdGenerator for distributed uniqueness A portion is based on a time value, eg: the seconds since the start of the month An additional portion provides uniqueness within a single JVM, using an Atomic Reference
12. Atomic References – Parent Node AtomicReference's compareAndSet() Used for visibility Used to enforce data integrity constraints Note ImmutableList
13. Atomic References – Takeaways Volatiles suffice where atomic check-then-act is overkill Some atomic nonblocking algorithms involve looping for a failed compareAndSet() During high thread contention this could actually mean inefficiency Most real-world threads have far more to do than mere lock contention though
14. Locks – ReadWriteLock Allows for multiple concurrent read locks No new read locks once a write lock is placed Write lock blocks until read locks complete Lock modes; non-fair (default), fair Reentrancy Downgrading
15. Blocking Queues Acts as a thread-safe implementation of a producer / consumer pattern JMS queue, though not distributed Insertion blocks until there is space available (for bounded queues)
16. Blocking Queues – Data Publish A very large (50GB) flat-file Consumers send data to a remote grid cache Multiple queue consumers increased throughput
17. Distributed Cached Data Snapshot n number of clients n number of HTTP requests across 6 load balanced Tomcat JVMs Threads failing to acquire lock immediately start shipping data What about the thread obtaining the lock?
19. Distributed Cached Data Snapshot Distributed competition for the publishing privilege Computation of completeness Communicate completeness Other threads in other JVMs happily polling for State.DONE
20. Distributed Cached Data Snapshot Coherence replicated cache supports cluster wide key-based lock Locked objects can still be read by other cluster threads without a lock Locks are unaffected by server failure (and will failover to a backup server.) Locks are immediately released when the lock owner (client) fails. Lock timeouts (-1, 0, 1+)
21. Distributed Cached Data Snapshot Hazelcast http://www.hazelcast.com/ Open source clustering and highly scalable data distribution platform for Java Distributed data structures Queue / Topic Map, MultiMap, Set, List Lock Effectively distributed java.util.concurrent Uses TCP/IP Cluster wide ID generators Distributed executor services
22. Distributed Cached Data Snapshot Apache Zookeeper http://hadoop.apache.org/zookeeper/ Terracotta http://www.terracotta.org/
23. Concurrency in Frameworks Hibernate Core 3.5.0 CountDownLatch Need a thread to wait until some number of events have occurred Constructed with the count of the # events which must occur before release Callable, ExecutorService, ReentrantLock, AtomicReference
24. Concurrency in Frameworks Spring Framework 3.0.1 TaskExecutor Spring 2.0 supported Java 1.4 TaskExecutor did not implement Executor In Spring 3.0 TaskExecutor extends Executor TaskExecutor sees wide use within Spring framework Quartz Message Driven POJO Spring Enterprise Recipies – Josh Long, Gar Mak
25. Shopzilla's Website Concurrency Needed sub 650ms server side response time Simplify the layers Functionally separate, individually testable, loosely coupled web-services Define SLAs for individual services
26. Shopzilla's Website Concurrency How to invoke 30+ web-services and ship a page in <650ms? Concurrency! In fact our pages today ship within 250ms
28. Shopzilla's Website Concurrency Started simple Implement only the concurrency features required Concurrency isolated to pods Pods responsible for fetching data We're using simple building blocks Incremental implementation based solely on requirements Haven't seen deadlocks
29. Shopzilla's Website Concurrency Thread longevity configured at the HTTP connection level HTTPClient connectionTimeout Spring wired HTTPClient implementation Ability to add a pod to a controller
30. Shopzilla's Website Concurrency FuturePodResult implements the PodResult interface Abstracts the details of the future PodCallable types the pod and command
31. Shopzilla's Website Concurrency Need to execute pods Configurable ExecutorService Backed with a queue Naming of threads proved useful in initial testing (JMX)
32. Shopzilla's Website Concurrency Once the concept was proven, interesting feature requests materializing Product Review pod Distilled, 2 pods needed to share a single result Added ServiceInvocation concept
33. Shopzilla's Website Concurrency Pods now have access to a Service Invocation Map get() blocks on the result of the service invocation A single service invocation result can be shared between two pods
34. Shopzilla's Website Concurrency Now we were sharing results, we were done, right? Product Review information was now required in-line in the product pod Still needed the special Product Review pod too! Dependent Service Invocations
35. Shopzilla's Website Concurrency Service Invocations can now depend on results of others Dependent Callable is configured with two callbacks; A callback whose result is blocked for A callback which is invoked once the blocking result arrives
36. Future More use of distributed data structures Spring 3.0 @Async @Scheduled JSR-315 Servlets 3.0 AsyncContext More parallelism at hardware level Message passing
37. Reference Java Concurrency in Practice (Goetz) Effective Java (Bloch) Spring Enterprise Recipes (Long, Mak) http://jcp.org/en/jsr/detail?id=133 http://jcp.org/en/jsr/detail?id=166 Spring 3.0
Editor's Notes
Today I’d like to share with you Shopzilla’s redesign of our consumer site and content delivery infrastructure.
So we’ve built an architecture that we believe to be performant and scalable. How do you go about testing this? There are a lot of moving parts * Highly concurrent requests, dozens of services, resource accesses Strategy: Each service is performance tested in isolation to its SLA. Then the full stack is performance tested
Shopzilla is one of the largest and most comprehensive online shopping networks on the web through our leading comparison shopping sites Bizrate.com and Shopzilla.com and the Shopzilla Publisher Program We help shoppers find best value, for virtually anything from 1000’s retailers Across our network we serve more than 100M impressions per day to anywhere from 20-30M unique visitors searching as many as 8000x/second for more than 105M products
Amdahl’s Law: 1 / ( ( 1-P) + P/N) P = Proportion of the program that can be made parallel N = number of processors
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application tier is a mashup of data from numerous sources All network communications via HTTP; no direct database access
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching
Web application is Java 1.6, Tomcat 6 Spring MVC Custom TAL templating engine Services are JAX-RS utilizing Apache CXF framework Database Access via Hibernate with Ehcache L2 caching Oracle 10g database We’re incorporating Oracle Coherence data grid for distributed caching