Build Your Own Recommendation Engine

•

8 likes•2,589 views

Michal Malohlava's presentation on Building Your Own Recommendation Engine 03.17.16 - Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai - To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata

Recommended for you

Getting Ready to Use Redis with Apache Spark with Tague Griffith

This technical tutorial is designed to address integrating Redis with an Apache Spark deployment to increase the performance of serving complex decision models. The session starts with a quick introduction to Redis and the capabilities Redis provides. It will cover the basic data types provided by Redis and the module system. Using an ad serving use case, Griffith will look at how Redis can improve the performance and reduce the cost of using complex ML-models in production. You will be guided through the key steps of setting up and integrating Redis with Spark, including how to train a model using Spark and then load and serve it using Redis, as well as how to work with the Spark Redis module. The capabilities of the Redis Machine Learning Module (redis-ml) will also be discussed, focusing primarily on decision trees and regression (linear and logistic) with code examples to demonstrate how to use these features. By the end of the session, you should feel confident building a prototype/proof-of-concept application using Redis and Spark. You’ll understand how Redis complements Spark, and how to use Redis to serve complex, ML-models with high performance.

•by Databricks

spark summitapache spark

Cascalog at May Bay Area Hadoop User Group

Cascalog is a Clojure-based query language for Hadoop that provides a powerful and easy-to-use tool for data analysis. It allows users to write queries as regular Clojure code, offering features like joins, aggregators, functions, and sorting. Cascalog is unique in that it offers the full power of Clojure at all times by integrating queries directly into the programming language. BackType uses Cascalog for tasks like identifying influencers on social media, determining exposure to URLs, and studying engagement over time.

•by nathanmarz

clojurehadoopcascalog

Lightning-Fast Analytics for Workday Transactional Data with Pavel Hardak and...

Workday Prism Analytics enables data discovery and interactive Business Intelligence analysis for Workday customers. Workday is a “pure SaaS” company, providing a suite of Financial and HCM (Human Capital Management) apps to about 2000 companies around the world, including more than 30% from Fortune-500 list. There are significant business and technical challenges to support millions of concurrent users and hundreds of millions daily transactions. Using memory-centric graph-based architecture allowed to overcome most of these problems. As Workday grew, data transactions from existing and new customers generated vast amounts of valuable and highly sensitive data. The next big challenge was to provide in-app analytics platform, which for the multiple types of accumulated data, and also would allow using blend in external datasets. Workday users wanted it to be super-fast, but also intuitive and easy-to-use both for the financial and HR analysts and for regular, less technical users. Existing backend technologies were not a good fit, so we turned to Apache Spark. In this presentation, we will share the lessons we learned when building highly scalable multi-tenant analytics service for transactional data. We will start with the big picture and business requirements. Then describe the architecture with batch and interactive modules for data preparation, publishing, and query engine, noting the relevant Spark technologies. Then we will dive into the internals of Prism’s Query Engine, focusing on Spark SQL, DataFrames and Catalyst compiler features used. We will describe the issues we encountered while compiling and executing complex pipelines and queries, and how we use caching, sampling, and query compilation techniques to support interactive user experience. Finally, we will share the future challenges for 2018 and beyond.

•by Databricks

apache sparksparkaisummit

Engine Architecture
Variation of λ-architecture…
… with pluggable ML 
backend

API Router
REST API via Spray 
Akka Actor accepting and ﬁltering:
• user activities
• recommendation requests
Scalable via HAProxy

API Router
 
Akka Actor handles 
• POST of user activity  
• publish activity to Redis 
• update stats in Redis (quick
updates) 
• trigger recommendation
computation

Recommended for you

Building Data Pipelines with Spark and StreamSets

Big data tools such as Hadoop and Spark allow you to process data at unprecedented scale, but keeping your processing engine fed can be a challenge. Metadata in upstream sources can ‘drift’ due to infrastructure, OS and application changes, causing ETL tools and hand-coded solutions to fail. StreamSets Data Collector (SDC) is an Apache 2.0 licensed open source platform for building big data ingest pipelines that allows you to design, execute and monitor robust data flows. In this session we’ll look at how SDC’s “intent-driven” approach keeps the data flowing, with a particular focus on clustered deployment with Spark and other exciting Spark integrations in the works.

•by Pat Patterson

sparkbigdatastreamsets

Strata San Jose 2016: Scalable Ensemble Learning with H2O

This document discusses scalable ensemble learning using the H2O platform. It provides an overview of ensemble methods like bagging, boosting, and stacking. The stacking or Super Learner algorithm trains a "metalearner" to optimally combine the predictions from multiple "base learners". The H2O platform and its Ensemble package implement Super Learner and other ensemble methods for tasks like regression and classification. An R code demo is presented on training ensembles with H2O.

•by Sri Ambati

h2o.aidatadata science

Stream All Things—Patterns of Modern Data Integration with Gwen Shapira

This document discusses patterns for modern data integration using streaming data. It outlines an evolution from data warehouses to data lakes to streaming data. It then describes four key patterns: 1) Stream all things (data) in one place, 2) Keep schemas compatible and process data on, 3) Enable ridiculously parallel single message transformations, and 4) Perform streaming data enrichment to add additional context to events. Examples are provided of using Apache Kafka and Kafka Connect to implement these patterns for a large hotel chain integrating various data sources and performing real-time analytics on customer events.

•by Databricks

apache sparkspark summit

API Router
 
Akka Actor handles
• GET recommendation request 
• fetch pre-computed
recommendation from Redis if exists 
• OR try to do best-effort to provide
“coldstart" recommendation based
on history of user activities

Redis Store
Redis is used as 
• events bus:
• inform subscribers about user
activities
• requests to provide new
recommendation for user 
• data storage
• old/new recommendations
• statistics (likes/swipe per user)
• simple persistence model 
• computation engine
• keep top-N artists, top-N songs per user

ML Backend
Language/technology agnostic
• Needs to be ﬂexible enough to prototype
different strategies
“Runners” for
• generating recommendations 
with H2O and Python
• collecting/generating statistics
• clustering users with H2O JVM
“Runners” are subscribed to Redis/
processing Redis data

ML Backend
Final strategy
• identify user cluster based on  
users activities (aka music styles)
• apply different recommendation 
strategies inside each cluster
• identify “weird” users (~outliers)
• adapt recommendation for them
• needs manual intervention/algorithm
tuning

Recommended for you

Realtime streaming architecture in INFINARIO

About our experience with realtime analyses on never-ending stream of user events. Discuss Lambda architecture, Kappa, Apache Kafka and our own approach.

•by Jozo Kovac

analyticsstreamingreal-time

Archiving, E-Discovery, and Supervision with Spark and Hadoop with Jordan Volz

This document discusses using Hadoop for archiving, e-discovery, and supervision. It outlines the key components of each task and highlights traditional shortcomings. Hadoop provides strengths like speed, ease of use, and security. An architectural overview shows how Hadoop can be used for ingestion, processing, analysis, and machine learning. Examples demonstrate surveillance use cases. While some obstacles remain, partners can help address areas like user interfaces and compliance storage.

•by Databricks

apache sparkspark summit

Deduplication and Author-Disambiguation of Streaming Records via Supervised M...

Here we present a general supervised framework for record deduplication and author-disambiguation via Spark. This work differentiates itself by – Application of Databricks and AWS makes this a scalable implementation. Compute resources are comparably lower than traditional legacy technology using big boxes 24/7. Scalability is crucial as Elsevier’s Scopus data, the biggest scientific abstract repository, covers roughly 250 million authorships from 70 million abstracts covering a few hundred years. – We create a fingerprint for each content by deep learning and/or word2vec algorithms to expedite pairwise similarity calculation. These encoders substantially reduce compute time while maintaining semantic similarity (unlike traditional TFIDF or predefined taxonomies). We will briefly discuss how to optimize word2vec training with high parallelization. Moreover, we show how these encoders can be used to derive a standard representation for all our entities namely such as documents, authors, users, journals, etc. This standard representation can simplify the recommendation problem into a pairwise similarity search and hence it can offer a basic recommender for cross-product applications where we may not have a dedicate recommender engine designed. – Traditional author-disambiguation or record deduplication algorithms are batch-processing with small to no training data. However, we have roughly 25 million authorships that are manually curated or corrected upon user feedback. Hence, it is crucial to maintain historical profiles and hence we have developed a machine learning implementation to deal with data streams and process them in mini batches or one document at a time. We will discuss how to measure the accuracy of such a system, how to tune it and how to process the raw data of pairwise similarity function into final clusters. Lessons learned from this talk can help all sort of companies where they want to integrate their data or deduplicate their user/customer/product databases.

•by Spark Summit

apache sparkspark summit

Results
• Single machine for API Router and Redis
• peeks 50 activities/sec, avg 10 activities/sec
• small memory footprint
• ML Runners spread over EC2 machines
• even simple but different strategies for each user sectors
and selected individual users provides surprisingly good
results

Learn more at h2o.ai
Follow us at @h2oai
Thank you!

What's hot

A Predictive Analytics Workflow on DICOM Images using Apache Spark with Anahi...

Databricks

This document discusses using Apache Spark for predictive analytics on DICOM medical images. It describes challenges in working with medical image and metadata, and how tools like Spark, Spark-TK, and TensorFlow can help analyze this data at scale. A live demo then shows building machine learning models on DICOM data to derive insights and predict patient outcomes or device performance. Performance tests analyze the impact of data size, partitions, executors, and cores on processing time.

From R Script to Production Using rsparkling with Navdeep Gill

Databricks

The rsparkling R package is an extension package for sparklyr (an R interface for Apache Spark) that creates an R front-end for the Sparkling Water Spark package from H2O. This provides an interface to H2O’s high performance, distributed machine learning algorithms on Spark, using R. The main purpose of this package is to provide a connector between sparklyr and H2O’s machine learning algorithms. In this session, Gill will introduce the basic architectures of rsparkling, H2O Sparkling Water and sparklyr, and go over how these frameworks work together to build a cohesive machine learning framework. In addition, you’ll learn about various implementations for using rsparkling in production. The session will conclude with a live demo of rsparkling that will display an end-to-end use case of data ingestion, munging and machine learning.

Disrupting Big Data with Apache Spark in the Cloud

Jen Aman

This document discusses the challenges of big data analytics and how Apache Spark and Databricks can help address them. It summarizes that: 1) There is a gap between the growth of data and ability to perform real-time analytics on that data due to challenges in managing infrastructure, empowering teams, and establishing production-ready applications. 2) Databricks provides a cloud-hosted platform that uses Apache Spark to allow for just-in-time processing of data across storage silos, with an integrated workspace for interactive exploration, machine learning, and production-ready workflows. 3) Databricks Enterprise Security provides an end-to-end security solution for Apache Spark to address challenges in securing file

Getting Ready to Use Redis with Apache Spark with Tague Griffith

Databricks

Cascalog at May Bay Area Hadoop User Group

nathanmarz

Lightning-Fast Analytics for Workday Transactional Data with Pavel Hardak and...

Databricks

Building Data Pipelines with Spark and StreamSets

Pat Patterson

Strata San Jose 2016: Scalable Ensemble Learning with H2O

Sri Ambati

Stream All Things—Patterns of Modern Data Integration with Gwen Shapira

Databricks

Realtime streaming architecture in INFINARIO

Jozo Kovac

Archiving, E-Discovery, and Supervision with Spark and Hadoop with Jordan Volz

Databricks

Deduplication and Author-Disambiguation of Streaming Records via Supervised M...

Spark Summit

Insights Without Tradeoffs Using Structured Streaming keynote by Michael Armb...

Spark Summit

Real-Time Analytics and Actions Across Large Data Sets with Apache Spark

Databricks

Around the world, businesses are turning to AI to transform the way they operate and serve their customers. But before they can implement these technologies, companies must address the roadblock of moving from batch analytics to making real-time decisions by rapidly accessing and analyzing the relevant information amidst a sea of data. Yaron will explain how to make Spark handle multivariate real-time, historical and event data simultaneously to provide immediate and intelligent responses. He will present several time sensitive use-cases including fraud detection, prevention of outages and customer recommendations to demonstrate how to perform predictive analytics and real-time actions with Spark. Speaker: Yaron Ekshtein

Productionizing H2O Models with Apache Spark with Jakub Hava and Michal Maloh...

Databricks

Spark pipelines represent a powerful concept to support productionizing machine learning workflows. Their API allows to combine data processing with machine learning algorithms and opens opportunities for integration with various machine learning libraries. However, to benefit from the power of pipelines, their users need to have a freedom to choose and experiment with any machine learning algorithm or library. Therefore, we developed Sparkling Water that embeds H2O machine learning library of advanced algorithms into the Spark ecosystem and exposes them via pipeline API. Furthermore, the algorithms benefit from H2O MOJOs – Model Object Optimized – a powerful concept shared across entire H2O platform to store and exchange models. The MOJOs are designed for effective model deployment with focus on scoring speed, traceability, exchangeability, and backward compatibility. In this talk we will explain the architecture of Sparkling Water with focus on integration into the Spark pipelines and MOJOs. We’ll demonstrate creation of pipelines integrating H2O machine learning models and their deployments using Scala or Python. Furthermore, we will show how to utilize pre-trained model MOJOs with Spark pipelines.

ASPgems - kappa architecture

Juantomás García Molina

Kappa Architecture is an alternative to Lambda Architecture that simplifies real-time data processing. It uses a distributed log like Kafka to store all input data immutably to allow reprocessing from the beginning if the processing code changes. This avoids having to maintain separate batch and real-time processing systems. The ASPgems team has implemented Kappa Architecture for several clients using Kafka, Spark Streaming, and Cassandra to provide real-time analytics and metrics in sectors like telecommunications, IoT, insurance, and energy.

Shifting Data Science into High Gear

Spark Summit

Rob Thomas discusses IBM's investments in Apache Spark and the IBM Data Science Experience. IBM is a major contributor to Spark and has introduced tools like SparkSQL and Stocator. The presentation also introduces the IBM Data Science Experience, an analytics IDE built on Spark that provides learning resources, project sharing capabilities, and community features to enable collaboration. Thomas explains how IBM is growing the ecosystem around the Data Science Experience through deep integrations with IBM tools and light integrations with independent software vendors.

Deep Learning for Large-Scale Online Fraud Detection—Fighting Fraudsters Amon...

Databricks

Here we present a real-time, scalable online fraud detection solution backed by deep learning technique. Nowadays, most deep learning applications are seen in actively studied fields including computer vision, natural language processing, etc. Our current solution represents one of the few production examples where deep learning models are applied to security problems. Our results demonstrate that deep learning solution outforms traditional blacklist and machine learning approaches significantly at terabyte-data scale. Online fraud is largely orchestrated by organized crime rings. Coordinated malicious user accounts, either created anew, or obtained via user hijacking, actively target the various modern online service for real-world financial gain. Existing fraud solutions either rely on reputation lists for blocking known suspicious activities, or require extensive feature engineering by human analysts for model training. These approaches do not adapt well to changing fraud patterns nor are able to scale to large data volumes. At DataVisor, we analyze activities from billions of accounts across global online services to detect fraud and abuse. These data gives us unique insights into the online fraud landscape that allow us to tackle the coordinated fraud attacks holistically. Our deep learning solution is based on digital information commonly collected by online services, including IP addresses, user-agent strings, email domains, user nicknames, etc. We build a general fraud detection framework which can identify fraudulent activities in log data that contain (all or a subnet of) these common digital information. By leveraging common digital information, the model is agnostic to the specific application or service from which data queries originate. We discuss the design and implementation of our deep learning pipeline based on Spark and Tensorflow that is built to fit our multi-cloud, real-time production requirements. We also demonstrate how our system outperforms traditional solutions including blacklists and machine learning methods.

Open Source DataViz with Apache Superset

Carl W. Handlin

This document introduces Apache Superset, an open source data exploration and visualization tool. Superset allows users to easily slice, dice and visualize data without coding knowledge. It was originally developed by engineers at Airbnb and is now maintained under the Apache license. Some key features include supporting multiple data sources, interactivity without coding, and being free to use. While still developing, Superset provides an open alternative to paid business intelligence tools.

Online Model Updating with Spark Streaming

Keira Zhou

What's hot (20)

A Predictive Analytics Workflow on DICOM Images using Apache Spark with Anahi...

From R Script to Production Using rsparkling with Navdeep Gill

Disrupting Big Data with Apache Spark in the Cloud

Getting Ready to Use Redis with Apache Spark with Tague Griffith

Cascalog at May Bay Area Hadoop User Group

Lightning-Fast Analytics for Workday Transactional Data with Pavel Hardak and...

Building Data Pipelines with Spark and StreamSets

Strata San Jose 2016: Scalable Ensemble Learning with H2O

Stream All Things—Patterns of Modern Data Integration with Gwen Shapira

Realtime streaming architecture in INFINARIO

Archiving, E-Discovery, and Supervision with Spark and Hadoop with Jordan Volz

Deduplication and Author-Disambiguation of Streaming Records via Supervised M...

Insights Without Tradeoffs Using Structured Streaming keynote by Michael Armb...

Real-Time Analytics and Actions Across Large Data Sets with Apache Spark

Productionizing H2O Models with Apache Spark with Jakub Hava and Michal Maloh...

ASPgems - kappa architecture

Shifting Data Science into High Gear

Deep Learning for Large-Scale Online Fraud Detection—Fighting Fraudsters Amon...

Open Source DataViz with Apache Superset

Online Model Updating with Spark Streaming

Viewers also liked

H2O Big Join Slides

Sri Ambati

NLP with H2O

Sri Ambati

The document outlines an agenda for a talk on natural language processing and machine learning using H2O. It discusses word embedding algorithms like Word2Vec, how to use Word2Vec in H2O to generate embeddings, and how embeddings can be used as features for supervised learning tasks. It then demonstrates building a movie genre classification model in H2O using Word2Vec embeddings, examining results, and techniques for improving performance like adding new features, early stopping, and grid search.

Intro to Machine Learning with H2O and AWS

Sri Ambati

Anti-Money Laundering Solution

Sri Ambati

In this talk, Ashrith will be introducing you to the idea of using machine learning for detecting money laundering. The idea behind using ML for detecting money laundering is that the current rules-based engine have limited visibility into money movement. And as models learn the nuances of money movement, especially illegal, much better money laundering detection is possible. Bio: Ashrith Barthur is a Security Scientist at H2O currently working on algorithms that detect anomalous behaviour in user activities, network traffic, attacks, financial fraud and global money movement. He has a PhD from Purdue University in the field of information security, specialized in Anomalous behaviour in DNS protocol. https://www.linkedin.com/in/abarthur/

Intro to Machine Learning for GPUs

Sri Ambati

1. The document discusses GPUs and their advantages for machine learning tasks like deep learning and parallel computing. GPUs have many parallel processors that can accelerate matrix multiplications and other computations used in machine learning algorithms. 2. It introduces CUDA and how it allows GPUs to be programmed for general purpose processing through a parallel computing model. Examples are given of how matrix multiplications and convolutional neural network operations can be parallelized on GPUs. 3. H2O is presented as a machine learning platform that supports GPU acceleration for algorithms like gradient boosted machines, enabling faster training on large datasets. Instructions are provided on getting started with CUDA, cuDNN and using GPUs for machine learning.

Large-Scale Training with GPUs at Facebook

Faisal Siddiqi

This document discusses large-scale distributed training with GPUs at Facebook using their Caffe2 framework. It describes how Facebook was able to train the ResNet-50 model on the ImageNet dataset in just 1 hour using 32 GPUs with 8 GPUs each. It explains how synchronous SGD was implemented in Caffe2 using Gloo for efficient all-reduce operations. Linear scaling of the learning rate with increased batch size was found to work best when gradually warming up the learning rate over the first few epochs. Nearly linear speedup was achieved using this approach on commodity hardware.

Parameter Server Approach for Online Learning at Twitter

Zhiyong (Joe) Xie

Parameter Server approaches for online learning at Twitter allow models to be updated continuously based on new data and improve predictions in real-time. Version 1.0 decouples training and prediction to increase efficiency. Version 2.0 scales training by distributing it across servers. Version 3.0 will scale large complex models by sharding models and features across multiple servers. These approaches enable Twitter to perform online learning on massive datasets and complex models in real-time.

2017 10-10 (netflix ml platform meetup) learning item and user representation...

Ed Chi

1) Learning user and item representations is challenging due to sparse data and shifting preferences in recommender systems. 2) The presentation outlines research at Google to address sparsity through two approaches: focused learning, which develops specialized models for subsets of data like genres or cold-start items, and factorized deep retrieval, which jointly embeds items and their features to predict preferences for fresh items. 3) The techniques have improved overall viewership and nomination of candidates, demonstrating their effectiveness in production recommender systems.

Horovod - Distributed TensorFlow Made Easy

Alexander Sergeev

Viewers also liked (9)

H2O Big Join Slides

NLP with H2O

Intro to Machine Learning with H2O and AWS

Anti-Money Laundering Solution

Intro to Machine Learning for GPUs

Large-Scale Training with GPUs at Facebook

Parameter Server Approach for Online Learning at Twitter

2017 10-10 (netflix ml platform meetup) learning item and user representation...

Horovod - Distributed TensorFlow Made Easy

Similar to Build Your Own Recommendation Engine

Webdistilled API

Vieri Emiliani

The document provides an overview of the Webdistilled ecosystem and API. It describes the large number of data sources in multiple languages and countries that are analyzed. It then summarizes the various API services for user management, projects, settings, selection queries, integration, tags, clips, views, agents, and analysis that allow accessing and interacting with the Webdistilled data and applications. It concludes by describing the API explorer tool that can be used to test the API methods.

Kinesis @ lyft

Mian Hamid

This talk focuses on how we used Amazon Kinesis to build the pub-sub infra at Lyft, that ingests more than a 100 billion events per day. We'll review the strengths and weaknesses of Kinesis as a choice for streaming events in realtime, at Lyft's scale; as well as the best practices and lessons learnt over time. Speaker: Hafiz Hamid (Lyft) Hafiz Hamid is a software engineer on the Pub-Sub/Streaming Platform team at Lyft. He has built some of the key pieces in the messaging & streaming infrastructure at Lyft. Previously, Hafiz was a technical lead at Bing Search where he worked on data pipelines, relevance and web crawlers.

Your API is Bad and You Should Feel Bad

Amanda Folson

Music streams

Stefano Galarraga

Crowdmix is a social network focused on music that runs on an event-driven architecture. It allows users to share music content from various streaming services across crowds. The system is designed for scalability to support millions of users. Key components include Kafka for event streaming, Spark/EMR for batch processing, Elasticsearch for search, and Cassandra/Redshift for materialized views and analytics. Music matching is required to identify tracks across different services. Performance testing showed the architecture could support the targeted user scale.

REST - Why, When and How? at AMIS25

Jon Petter Hjulstad

This document provides an overview of REST APIs and discusses why REST is commonly preferred over SOAP. It describes various REST API description languages (ADLs) like Swagger, RAML, and WADL and compares their support in Oracle products. It also provides examples of describing a sample Norwegian dataset API in RAML and implementing REST support in SOA Suite, including creating WADLs from other ADLs or using the REST adapter. The document concludes with discussing REST support in Java EE and Oracle PaaS products.

Building Scalable Big Data Infrastructure Using Open Source Software Presenta...

ssuserd3a367

1) StumbleUpon uses open source tools like Kafka, HBase, Hive and Pig to build a scalable big data infrastructure to process large amounts of data from its services in real-time and batch. 2) Data is collected from various services using Kafka and stored in HBase for real-time analytics. Batch processing is done using Pig and data is loaded into Hive for ad-hoc querying. 3) The infrastructure powers various applications like recommendations, ads and business intelligence dashboards.

Design & Deploy a data-driven Web API in 2 hours

Restlet

WSO2 ESB Integration with REST

WSO2

WSO2 provides an open source integration platform that enables organizations to expose existing services and applications through RESTful APIs. The platform uses the Apache Synapse ESB at its core to provide mediation capabilities. RESTful APIs in WSO2 ESB allow resources to be exposed over HTTP and dispatched based on URL patterns and HTTP verbs. This allows for building and consuming RESTful services and integrations.

SharePoint Saturday The Conference 2011 - SP2010 Performance

Brian Culver

Is your farm struggling to server your organization? How long is it taking between page requests? Where is your bottleneck in your farm? Is your SQL Server tuned properly? Worried about upgrading due to poor performance? We will look at various tools for analyzing and measuring performance of your farm. We will look at simple SharePoint and IIS configuration options to instantly improve performance. I will discuss advanced approaches for analyzing, measuring and implementing optimizations in your farm.

Role of Rest vs. Web Services and EI

WSO2

REST is a lightweight architecture for building client-server applications. It uses standard HTTP methods to allow requesting and modifying resource state representations. While SOAP and web services will continue to be used, REST is better suited for mobile and web applications. Organizations are realizing they cannot replace existing technologies and instead focus on integrating technologies to leverage their respective strengths. Exposing existing systems through a REST API gateway allows for coexistence while providing a clean interface. Security, caching, throttling and monitoring are important when managing REST APIs at an enterprise scale.

Social Architecture of SharePoint 2013 for Developers

Paul J. Swider

The document provides an overview of the social architecture and features in SharePoint, including community sites, profile pages, following, and tasks. It discusses the APIs and programming models for extending social features. Key points include how activities are stored in the microfeed list and social list, using the social actor and person properties objects, and the REST APIs for following, feeds, and the people picker. It notes some deprecated social features from previous versions of SharePoint.

Aesop change data propagation

Regunath B

1) Aesop is an open source change data capture and propagation tool that reliably captures changes from data sources and propagates them to other data stores and systems to enable eventual consistency across polyglot data platforms. 2) It uses log mining to capture changes from data sources like MySQL and propagates the change events to consumers like ElasticSearch and HBase through an enhanced relay component. 3) It provides utilities for bootstrapping consumers, monitoring and administering the system, and has been used in production at Flipkart for applications including payments, ETL, and data serving.

Lantea platform

Neuzilla

Lantea is an open source big data platform for .NET that allows easy extraction, transformation, and loading of data from various sources. It features SQL querying of aggregated data, simple data collection from websites, files, emails and databases, and export of data in multiple formats and APIs. Lantea is targeted towards data scientists, market analysts, managers needing business intelligence, researchers, and big data developers.

Big ideas in small packages - How microservices helped us to scale our vision

Sebastian Schleicher

Blinkist started as a monolith application but transitioned to microservices as the business scaled. This allowed each part of the application to scale independently and improved development speed. Key learnings included only migrating existing features if it provided value, embracing AWS vendor lock-in while acting wisely, and building new features as microservices adhering to standards like JSON API. The monolith validated their product vision while microservices helped scale the business through a stable, automated infrastructure.

SharePoint 2013 - What's New

AdventosConsulting

Hadoop in the Cloud: Common Architectural Patterns

DataWorks Summit

The document discusses how companies are using Microsoft Azure services like HDInsight, Data Factory, Machine Learning, and others to gain insights from large volumes of data. Specifically, it provides examples of: 1) A large computer manufacturer/retailer analyzing clickstream data with HDInsight to understand customer behavior and provide real-time recommendations to increase online conversions. 2) An industrial automation company partnering with an oil company to use IoT sensors and analytics to monitor LNG fueling stations for proactive maintenance based on sensor data analyzed with HDInsight, Data Factory, and Machine Learning. 3) How data from various industries like retail, oil and gas, manufacturing, and others can be analyzed

REST APIs

Arthur De Magalhaes

This document discusses RESTful microservices and best practices for designing REST APIs. It covers topics like why REST is important for API design, common REST principles, naming conventions, resource relationships, security, versioning, documentation, and management of REST APIs. It also provides examples of how various companies implement practices like filtering, searching, paging, and error handling in their REST APIs. Finally, it discusses how the WebSphere Liberty application server supports REST APIs through features like API discovery and collective APIs.

SharePoint Saturday San Antonio: SharePoint 2010 Performance

Brian Culver

CCI2018 - Real-time dashboard whatif analysis

walk2talk srl

Marco Pozzan Power BI consultant & Trainer Scenario di utilizzo del real-time di Power BI. In questa sessione verrà introdotta la teoria sul real-time dashboarding offerto da Power BI. Poi ci si focalizzerà sun un caso pratico di real-time dataset in modalità ibrida per la realizzazione di una dashboard di controllo con la possibilità di effettuare il write back e permettere all’utente di effettuare analisi what-if.

Solr and ElasticSearch demo and speaker feb 2014

nkabra

The document provides an overview of distributed database architecture and search technologies. It discusses Solr and ElasticSearch, including their history, key features, use cases, and migration process. A presentation is given covering basics, current usage, highlights, and taking questions. Examples are provided of companies using ElasticSearch for applications like resume recommendations, integration, and searching large collections of documents.

Similar to Build Your Own Recommendation Engine (20)

Webdistilled API

Kinesis @ lyft

Your API is Bad and You Should Feel Bad

Music streams

REST - Why, When and How? at AMIS25

Building Scalable Big Data Infrastructure Using Open Source Software Presenta...

Design & Deploy a data-driven Web API in 2 hours

WSO2 ESB Integration with REST

SharePoint Saturday The Conference 2011 - SP2010 Performance

Role of Rest vs. Web Services and EI

Social Architecture of SharePoint 2013 for Developers

Aesop change data propagation

Lantea platform

Big ideas in small packages - How microservices helped us to scale our vision

SharePoint 2013 - What's New

Hadoop in the Cloud: Common Architectural Patterns

REST APIs

SharePoint Saturday San Antonio: SharePoint 2010 Performance

CCI2018 - Real-time dashboard whatif analysis

Solr and ElasticSearch demo and speaker feb 2014

More from Sri Ambati

GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...

Sri Ambati

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day

Sri Ambati

This document provides an overview of H2O.ai, an AI company that offers products and services to democratize AI. It mentions that H2O products are backed by 10% of the world's top data scientists from Kaggle and that H2O has customers in 7 of the top 10 banks, 4 of the top 10 insurance companies, and top manufacturing companies. It also provides details on H2O's founders, funding, customers, products, and vision to make AI accessible to more organizations.

Generative AI Masterclass - Model Risk Management.pptx

Sri Ambati

Here are some key points about benchmarking and evaluating generative AI models like large language models: - Foundation models require large, diverse datasets to be trained on in order to learn broad language skills and knowledge. Fine-tuning can then improve performance on specific tasks. - Popular benchmarks evaluate models on tasks involving things like commonsense reasoning, mathematics, science questions, generating truthful vs false responses, and more. This helps identify model capabilities and limitations. - Custom benchmarks can also be designed using tools like Eval Studio to systematically test models on specific applications or scenarios. Both automated and human evaluations are important. - Leaderboards like HELM aggregate benchmark results to compare how different models perform across a wide range of tests and metrics.

AI and the Future of Software Development: A Sneak Peek

Sri Ambati

LLMOps: Match report from the top of the 5th

Sri Ambati

The document discusses LLMOps (Large Language Model Operations) compared to traditional MLOps. Some key points: - LLMOps and MLOps face similar challenges across the development lifecycle, but LLMOps requires more GPU resources and integration is faster due to more models in each application. Evaluation is also less clear. - The LLMOps field is around the 5th generation of models, with debates around proprietary vs open source models, and balancing privacy, cost and control. - LLMOps platforms are emerging to provide solutions for tasks like prompting, embedding databases, evaluation, and governance, similar to how MLOps platforms have evolved.

Building, Evaluating, and Optimizing your RAG App for Production

Sri Ambati

The document discusses optimizing question answering systems called RAG (Retrieve-and-Generate) stacks. It outlines challenges with naive RAG approaches and proposes solutions like improved data representations, advanced retrieval techniques, and fine-tuning large language models. Table stakes optimizations include tuning chunk sizes, prompt engineering, and customizing LLMs. More advanced techniques involve small-to-big retrieval, multi-document agents, embedding fine-tuning, and LLM fine-tuning.

Building LLM Solutions using Open Source and Closed Source Solutions in Coher...

Sri Ambati

Sandeep Singh, Head of Applied AI Computer Vision, Beans.ai H2O Open Source GenAI World SF 2023 In the modern era of machine learning, leveraging both open-source and closed-source solutions has become paramount for achieving cutting-edge results. This talk delves into the intricacies of seamlessly integrating open-source Large Language Model (LLM) solutions like Vicuna, Falcon, and Llama with industry giants such as ChatGPT and Google's Palm. As the demand for fine-tuned and specialized datasets grows, it is imperative to understand the synergy between these tools. Attendees will gain insights into best practices for building and enriching datasets tailored for fine-tuning tasks, ensuring that their LLM projects are both robust and efficient. Through real-world examples and hands-on demonstrations, this talk will equip attendees with the knowledge to harness the power of both open and closed-source tools in a coherent and effective manner.

Risk Management for LLMs

Sri Ambati

Patrick Hall, Professor, AI Risk Management, The George Washington University H2O Open Source GenAI World SF 2023 Language models are incredible engineering breakthroughs but require auditing and risk management before productization. These systems raise concerns about toxicity, transparency and reproducibility, intellectual property licensing and ownership, disinformation and misinformation, supply chains, and more. How can your organization leverage these new tools without taking on undue or unknown risks? While language models and associated risk management are in their infancy, a small number of best practices in governance and risk are starting to emerge. If you have a language model use case in mind, want to understand your risks, and do something about them, this presentation is for you!

Open-Source AI: Community is the Way

Sri Ambati

Dr. Alexy Khrabrov, Open Source Science Community Director, IBM H2O Open Source GenAI World SF 2023 In this talk, Dr. Alexy Khrabrov, recently elected Chair of the new Generative AI Commons at Linux Foundation for AI & Data, outlines the OSS AI landscape, challenges, and opportunities. With new models and frameworks being unveiled weekly, one thing remains constant: community building and validation of all aspects of AI is key to reliable and responsible AI we can use for business and society needs. Industrial AI is one key area where such community validation can prove invaluable.

Building Custom GenAI Apps at H2O

Sri Ambati

The document announces the launch of the H2O GenAI App Store, which provides a collection of applications that make it easier for average users to leverage large language models through custom interfaces for specific tasks like getting gardening advice or feedback on code. The app store is designed to accelerate the development of these GenAI apps using the H2O Wave platform and provides access to H2OGPTE for retrieval augmented generation and language model calls. Developers can also contribute their own apps through the GitHub repository listed.

Applied Gen AI for the Finance Vertical

Sri Ambati

Megan Kurka, Vice President, Customer Data Scientist, H2O.ai H2O Open Source GenAI World SF 2023 Discover the transformative power of Applied Gen AI. Learn how the H2O team builds customized applications and workflows that integrate capabilities of Gen AI and AutoML specifically designed to address and enhance financial use cases. Explore real world examples, learn best practices, and witness firsthand how our innovative solutions are reshaping the landscape of finance technology.

Cutting Edge Tricks from LLM Papers

Sri Ambati

This document discusses techniques for improving language models (LLMs) discussed in recent papers. It describes building blocks of LLMs like fine-tuning, foundation training, memory, and databases. Specific techniques covered include LIMA which uses 1,000 carefully curated examples, instruction backtranslation to generate question-answer pairs, fine-tuning models on API examples like Gorilla, and reducing false answers through techniques like not agreeing with incorrect user opinions. The goal is to discuss cutting edge tricks to build better LLMs.

Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...

Sri Ambati

Pascal Pfeiffer, Principal Data Scientist, H2O.ai H2O Open Source GenAI World SF 2023 This talk dives into the expansive ecosystem of Large Language Models (LLMs), offering practitioners an insightful guide to various relevant applications, from natural language understanding to creative content generation. While exploring use cases across different industries, it also honestly addresses the current limitations of LLMs and anticipates future advancements.

Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...

Sri Ambati

KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...

Sri Ambati

This document discusses using large language models (LLMs) for text classification tasks. It begins by describing how LLMs are commonly used for text generation and question answering. For classification, models are usually trained supervised on labeled data. The document then explores using LLMs for zero-shot classification without training, and techniques like fine-tuning LLMs on tasks to improve performance. It provides an example of fine-tuning an LLM on a financial sentiment dataset. The document concludes by describing H2O.ai's LLM Studio tool for fine-tuning and a few Kaggle competitions where LLMs achieved success in text classification.

LLM Interpretability

Sri Ambati

1) Generative AI (GenAI) enables the creation of novel content by learning patterns in unstructured data rather than labeling outputs like traditional AI. 2) Both traditional and generative AI models lack transparency and may contain biases, but generative models can additionally hallucinate or leak private information. 3) To interpret generative models, researchers evaluate accuracy globally by checking for hallucinations or undesirable content, and locally by confirming the quality of individual responses.

Never Reply to an Email Again

Sri Ambati

Introducción al Aprendizaje Automatico con H2O-3 (1)

Sri Ambati

From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...

Sri Ambati

Numerai is an open, crowd-sourced hedge fund powered by predictions from data scientists around the world. In return, participants are rewarded with weekly payouts in crypto. In this talk, Joe will give an overview of the Numerai tournament based on his own experience. He will then explain how he automates the time-consuming tasks such as testing different modelling strategies, scoring new datasets, submitting predictions to Numerai as well as monitoring model performance with H2O Driverless AI and R.

AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...

Sri Ambati

In this session, you will learn about what you should do after you’ve taken an AI transformation baseline. Over the span of this session, we will discuss the next steps in moving toward AI readiness through alignment of talent and tools to drive successful adoption and continuous use within an organization. To find additional videos on AI courses, earn badges, join the courses at H2O.ai Learning Center: https://training.h2o.ai/products/ai-foundations-course To find the Youtube video about this presentation: https://youtu.be/K1Cl3x3rd8g Speaker: Chemere Davis (H2O.ai - Senior Data Scientist Training Specialist)

More from Sri Ambati (20)

GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day

Generative AI Masterclass - Model Risk Management.pptx

AI and the Future of Software Development: A Sneak Peek

LLMOps: Match report from the top of the 5th

Building, Evaluating, and Optimizing your RAG App for Production

Building LLM Solutions using Open Source and Closed Source Solutions in Coher...

Risk Management for LLMs

Open-Source AI: Community is the Way

Building Custom GenAI Apps at H2O

Applied Gen AI for the Finance Vertical

Cutting Edge Tricks from LLM Papers

Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...

Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...

KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...

LLM Interpretability

Never Reply to an Email Again

Introducción al Aprendizaje Automatico con H2O-3 (1)

From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...

AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...

Recently uploaded

Best Programming Language for Civil Engineers

Awais Yaseen

The integration of programming into civil engineering is transforming the industry. We can design complex infrastructure projects and analyse large datasets. Imagine revolutionizing the way we build our cities and infrastructure, all by the power of coding. Programming skills are no longer just a bonus—they’re a game changer in this era. Technology is revolutionizing civil engineering by integrating advanced tools and techniques. Programming allows for the automation of repetitive tasks, enhancing the accuracy of designs, simulations, and analyses. With the advent of artificial intelligence and machine learning, engineers can now predict structural behaviors under various conditions, optimize material usage, and improve project planning.

Implementations of Fused Deposition Modeling in real world

Emerging Tech

The presentation showcases the diverse real-world applications of Fused Deposition Modeling (FDM) across multiple industries: 1. **Manufacturing**: FDM is utilized in manufacturing for rapid prototyping, creating custom tools and fixtures, and producing functional end-use parts. Companies leverage its cost-effectiveness and flexibility to streamline production processes. 2. **Medical**: In the medical field, FDM is used to create patient-specific anatomical models, surgical guides, and prosthetics. Its ability to produce precise and biocompatible parts supports advancements in personalized healthcare solutions. 3. **Education**: FDM plays a crucial role in education by enabling students to learn about design and engineering through hands-on 3D printing projects. It promotes innovation and practical skill development in STEM disciplines. 4. **Science**: Researchers use FDM to prototype equipment for scientific experiments, build custom laboratory tools, and create models for visualization and testing purposes. It facilitates rapid iteration and customization in scientific endeavors. 5. **Automotive**: Automotive manufacturers employ FDM for prototyping vehicle components, tooling for assembly lines, and customized parts. It speeds up the design validation process and enhances efficiency in automotive engineering. 6. **Consumer Electronics**: FDM is utilized in consumer electronics for designing and prototyping product enclosures, casings, and internal components. It enables rapid iteration and customization to meet evolving consumer demands. 7. **Robotics**: Robotics engineers leverage FDM to prototype robot parts, create lightweight and durable components, and customize robot designs for specific applications. It supports innovation and optimization in robotic systems. 8. **Aerospace**: In aerospace, FDM is used to manufacture lightweight parts, complex geometries, and prototypes of aircraft components. It contributes to cost reduction, faster production cycles, and weight savings in aerospace engineering. 9. **Architecture**: Architects utilize FDM for creating detailed architectural models, prototypes of building components, and intricate designs. It aids in visualizing concepts, testing structural integrity, and communicating design ideas effectively. Each industry example demonstrates how FDM enhances innovation, accelerates product development, and addresses specific challenges through advanced manufacturing capabilities.

How RPA Help in the Transportation and Logistics Industry.pptx

SynapseIndia

Active Inference is a veryyyyyyyyyyyyyyyyyyyyyyyy

RaminGhanbari2

TrustArc Webinar - 2024 Data Privacy Trends: A Mid-Year Check-In

TrustArc

Six months into 2024, and it is clear the privacy ecosystem takes no days off!! Regulators continue to implement and enforce new regulations, businesses strive to meet requirements, and technology advances like AI have privacy professionals scratching their heads about managing risk. What can we learn about the first six months of data privacy trends and events in 2024? How should this inform your privacy program management for the rest of the year? Join TrustArc, Goodwin, and Snyk privacy experts as they discuss the changes we’ve seen in the first half of 2024 and gain insight into the concrete, actionable steps you can take to up-level your privacy program in the second half of the year. This webinar will review: - Key changes to privacy regulations in 2024 - Key themes in privacy and data governance in 2024 - How to maximize your privacy program in the second half of 2024

What's New in Copilot for Microsoft365 May 2024.pptx

Stephanie Beckett

INDIAN AIR FORCE FIGHTER PLANES LIST.pdf

jackson110191

The Increasing Use of the National Research Platform by the CSU Campuses

Larry Smarr

Comparison Table of DiskWarrior Alternatives.pdf

Andrey Yasko

Recent Advancements in the NIST-JARVIS Infrastructure

KAMAL CHOUDHARY

Mitigating the Impact of State Management in Cloud Stream Processing Systems

ScyllaDB

Stream processing is a crucial component of modern data infrastructure, but constructing an efficient and scalable stream processing system can be challenging. Decoupling compute and storage architecture has emerged as an effective solution to these challenges, but it can introduce high latency issues, especially when dealing with complex continuous queries that necessitate managing extra-large internal states. In this talk, we focus on addressing the high latency issues associated with S3 storage in stream processing systems that employ a decoupled compute and storage architecture. We delve into the root causes of latency in this context and explore various techniques to minimize the impact of S3 latency on stream processing performance. Our proposed approach is to implement a tiered storage mechanism that leverages a blend of high-performance and low-cost storage tiers to reduce data movement between the compute and storage layers while maintaining efficient processing. Throughout the talk, we will present experimental results that demonstrate the effectiveness of our approach in mitigating the impact of S3 latency on stream processing. By the end of the talk, attendees will have gained insights into how to optimize their stream processing systems for reduced latency and improved cost-efficiency.

Transcript: Details of description part II: Describing images in practice - T...

BookNet Canada

This presentation explores the practical application of image description techniques. Familiar guidelines will be demonstrated in practice, and descriptions will be developed “live”! If you have learned a lot about the theory of image description techniques but want to feel more confident putting them into practice, this is the presentation for you. There will be useful, actionable information for everyone, whether you are working with authors, colleagues, alone, or leveraging AI as a collaborator. Link to presentation recording and slides: https://bnctechforum.ca/sessions/details-of-description-part-ii-describing-images-in-practice/ Presented by BookNet Canada on June 25, 2024, with support from the Department of Canadian Heritage.

Calgary MuleSoft Meetup APM and IDP .pptx

ishalveerrandhawa1

The Rise of Supernetwork Data Intensive Computing

Larry Smarr

Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops

Mydbops

This presentation, delivered at the Postgres Bangalore (PGBLR) Meetup-2 on June 29th, 2024, dives deep into connection pooling for PostgreSQL databases. Aakash M, a PostgreSQL Tech Lead at Mydbops, explores the challenges of managing numerous connections and explains how connection pooling optimizes performance and resource utilization. Key Takeaways: * Understand why connection pooling is essential for high-traffic applications * Explore various connection poolers available for PostgreSQL, including pgbouncer * Learn the configuration options and functionalities of pgbouncer * Discover best practices for monitoring and troubleshooting connection pooling setups * Gain insights into real-world use cases and considerations for production environments This presentation is ideal for: * Database administrators (DBAs) * Developers working with PostgreSQL * DevOps engineers * Anyone interested in optimizing PostgreSQL performance Contact info@mydbops.com for PostgreSQL Managed, Consulting and Remote DBA Services

Advanced Techniques for Cyber Security Analysis and Anomaly Detection

Bert Blevins

Cybersecurity is a major concern in today's connected digital world. Threats to organizations are constantly evolving and have the potential to compromise sensitive information, disrupt operations, and lead to significant financial losses. Traditional cybersecurity techniques often fall short against modern attackers. Therefore, advanced techniques for cyber security analysis and anomaly detection are essential for protecting digital assets. This blog explores these cutting-edge methods, providing a comprehensive overview of their application and importance.

Quality Patents: Patents That Stand the Test of Time

Aurora Consulting

Is your patent a vanity piece of paper for your office wall? Or is it a reliable, defendable, assertable, property right? The difference is often quality. Is your patent simply a transactional cost and a large pile of legal bills for your startup? Or is it a leverageable asset worthy of attracting precious investment dollars, worth its cost in multiples of valuation? The difference is often quality. Is your patent application only good enough to get through the examination process? Or has it been crafted to stand the tests of time and varied audiences if you later need to assert that document against an infringer, find yourself litigating with it in an Article 3 Court at the hands of a judge and jury, God forbid, end up having to defend its validity at the PTAB, or even needing to use it to block pirated imports at the International Trade Commission? The difference is often quality. Quality will be our focus for a good chunk of the remainder of this season. What goes into a quality patent, and where possible, how do you get it without breaking the bank? ** Episode Overview ** In this first episode of our quality series, Kristen Hansen and the panel discuss: ⦿ What do we mean when we say patent quality? ⦿ Why is patent quality important? ⦿ How to balance quality and budget ⦿ The importance of searching, continuations, and draftsperson domain expertise ⦿ Very practical tips, tricks, examples, and Kristen’s Musts for drafting quality applications https://www.aurorapatents.com/patently-strategic-podcast.html

find out more about the role of autonomous vehicles in facing global challenges

huseindihon

Cookies program to display the information though cookie creation

shanthidl1

How Social Media Hackers Help You to See Your Wife's Message.pdf

HackersList

Recently uploaded (20)

Best Programming Language for Civil Engineers

Implementations of Fused Deposition Modeling in real world

How RPA Help in the Transportation and Logistics Industry.pptx

Active Inference is a veryyyyyyyyyyyyyyyyyyyyyyyy

TrustArc Webinar - 2024 Data Privacy Trends: A Mid-Year Check-In

What's New in Copilot for Microsoft365 May 2024.pptx

INDIAN AIR FORCE FIGHTER PLANES LIST.pdf

The Increasing Use of the National Research Platform by the CSU Campuses

Comparison Table of DiskWarrior Alternatives.pdf

Recent Advancements in the NIST-JARVIS Infrastructure

Mitigating the Impact of State Management in Cloud Stream Processing Systems

Transcript: Details of description part II: Describing images in practice - T...

Calgary MuleSoft Meetup APM and IDP .pptx

The Rise of Supernetwork Data Intensive Computing

Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops

Advanced Techniques for Cyber Security Analysis and Anomaly Detection

Quality Patents: Patents That Stand the Test of Time

find out more about the role of autonomous vehicles in facing global challenges

Cookies program to display the information though cookie creation

How Social Media Hackers Help You to See Your Wife's Message.pdf

Build Your Own Recommendation Engine

1. Build Your Own Recommendation Engine   (during Weekend) Michal Malohlava @mmalohlava && @h2oai presents

2. MusicService Activities   clicks/swipes/likes Clients iOS/Android/… Next   N-recommendations ? REBB*? *REBB = Recommendation Engine Black Box

3. Requirements Activities   can be >100/s REBB should be accessible via REST API Recommendations need to be served <500ms, should keep users exploring AWS infrastructure Need to   be ready in 2 days!

4. Requirements • Recommendations should be served <500ms • ML part should allow quick prototyping & experimentation • Storage (online/ofﬂine) - user stats, histories, recommendations • Scalable • frontend receiving requests • backend solving ML • storage Need to   be ready in 2 days!

5. Engine Architecture Variation of λ-architecture… … with pluggable ML  backend

6. Engine Architecture Regular EC2 nodes

7. API Router REST API via Spray  Akka Actor accepting and ﬁltering: • user activities • recommendation requests Scalable via HAProxy

8. API Router   Akka Actor handles  • POST of user activity   • publish activity to Redis  • update stats in Redis (quick updates)  • trigger recommendation computation

9. API Router   Akka Actor handles • GET recommendation request  • fetch pre-computed recommendation from Redis if exists  • OR try to do best-effort to provide “coldstart" recommendation based on history of user activities 

10. Redis Store Redis is used as  • events bus: • inform subscribers about user activities • requests to provide new recommendation for user  • data storage • old/new recommendations • statistics (likes/swipe per user) • simple persistence model  • computation engine • keep top-N artists, top-N songs per user

11. ML Backend Language/technology agnostic • Needs to be ﬂexible enough to prototype different strategies “Runners” for • generating recommendations  with H2O and Python • collecting/generating statistics • clustering users with H2O JVM “Runners” are subscribed to Redis/ processing Redis data

12. ML Backend Final strategy • identify user cluster based on   users activities (aka music styles) • apply different recommendation  strategies inside each cluster • identify “weird” users (~outliers) • adapt recommendation for them • needs manual intervention/algorithm tuning

13. Results • Single machine for API Router and Redis • peeks 50 activities/sec, avg 10 activities/sec • small memory footprint • ML Runners spread over EC2 machines • even simple but different strategies for each user sectors and selected individual users provides surprisingly good results

14. Learn more at h2o.ai Follow us at @h2oai Thank you!

Build Your Own Recommendation Engine

Related slideshows

Recommended for you

Recommended for you

Recommended for you

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (9)

Similar to Build Your Own Recommendation Engine

Similar to Build Your Own Recommendation Engine (20)

More from Sri Ambati

More from Sri Ambati (20)

Recently uploaded

Recently uploaded (20)

Build Your Own Recommendation Engine