Sunnyvale, California, United States
Contact Info
938 followers
500+ connections
About
Activity
-
Our Kumo AI team is attending the International Conference on Machine Learning. Join us in Vienna, Austria, from July 21st to 27th. Find out more…
Our Kumo AI team is attending the International Conference on Machine Learning. Join us in Vienna, Austria, from July 21st to 27th. Find out more…
Liked by Yiou X.
-
It is heartwarming to see this level of excitement from our customers with the kind of results Kumo.AI delivers out of the box. Can't wait to add…
It is heartwarming to see this level of excitement from our customers with the kind of results Kumo.AI delivers out of the box. Can't wait to add…
Liked by Yiou X.
-
Dr. Sijia Liu, Assistant Professor of Computer Science and Engineering, has been awarded the NSF CAREER Award entitled Zeroth-Order Machine Learning:…
Dr. Sijia Liu, Assistant Professor of Computer Science and Engineering, has been awarded the NSF CAREER Award entitled Zeroth-Order Machine Learning:…
Liked by Yiou X.
Experience & Education
Licenses & Certifications
Publications
-
Prediction of biological functions by histone modification patterns profiling
Bicob
Histone modifications provide an important layer of gene regulation in eukaryotes. In this paper, we propose an approach that identifies the histone modification patterns most relevant for specific biological functions, such as flowering in plants. We first propose a new pattern scoring method, which evaluates the importance of each combinatorial pattern of histone modifications; this is used along with logistic regression, Support Vector Machines, and naive Bayesian classifier algorithms to…
Histone modifications provide an important layer of gene regulation in eukaryotes. In this paper, we propose an approach that identifies the histone modification patterns most relevant for specific biological functions, such as flowering in plants. We first propose a new pattern scoring method, which evaluates the importance of each combinatorial pattern of histone modifications; this is used along with logistic regression, Support Vector Machines, and naive Bayesian classifier algorithms to predict gene functions. This approach is shown to be successful in inferring significant patterns verified by independent gene function data, outperforming other pattern scores used in current histone modification analysis research.
-
Efficient Classification of Binary Data Stream with Concept Drifting Using Conjunction Rule based Boolean Classifier
IEA-AIE/Springer
We propose a conjunction rule based classification technique that has good classification performance, is simple, automatically identifies important attributes, and is extremely fast. Due to these properties the classifier is most suitable for “big” streaming data. Empirical study, using multiple datasets, shows that time complexity, compared with other classifiers, is faster by several factors, especially for large number of attributes without sacrificing performance.
-
A fast sorting algorithm for aptamer identification using deep sequencing
ASONAM/IEEE
Abstract:
In recent years, with the advent of fast sequencing technology, the genomic database is growing rapidly. Researchers in the bioinformatics field are expecting faster and more accurate tools to effectively analyze the gigantic data sets. In the context of aptamer search, the goal is to search for the over-represented DNA sequences from the randomly generated aptamer libraries. Hash functions are widely used in substring comparison, sequence alignment and clustering tools. We have…Abstract:
In recent years, with the advent of fast sequencing technology, the genomic database is growing rapidly. Researchers in the bioinformatics field are expecting faster and more accurate tools to effectively analyze the gigantic data sets. In the context of aptamer search, the goal is to search for the over-represented DNA sequences from the randomly generated aptamer libraries. Hash functions are widely used in substring comparison, sequence alignment and clustering tools. We have developed a light-weight tool that takes advantage of the hash functions to reduce the size of genomic data and conducts η-neighbor searches on the centroid sequence. This greatly improves the efficiency of the search compared with existing tools. Furthermore, the prior calculation of hash values of η-neighbors decreases the searching overhead. In a dataset of 2.23 million sequences, the proposed algorithm accurately count the frequency of the Human α-Thrombin aptamer sequences in less than 40 seconds, whereas the current script-based method takes 2 hours and 18 minutes. -
Utilizing cis-element to refine gene regulatory network
BIBM/IEEE
Gene regulatory networks (GRNs) describe epistatic relationship of genes and how the expression of some genes influence the expression of other genes. This information is critical for understanding molecular mechanisms regulating various biological processes and molecular basis of several diseases. Current research work mostly attempts to infer such regulatory relationships (and GRN architecture topology) from gene expression data. This paper improves on this methodology by utilizing additional…
Gene regulatory networks (GRNs) describe epistatic relationship of genes and how the expression of some genes influence the expression of other genes. This information is critical for understanding molecular mechanisms regulating various biological processes and molecular basis of several diseases. Current research work mostly attempts to infer such regulatory relationships (and GRN architecture topology) from gene expression data. This paper improves on this methodology by utilizing additional information available that describes which cis-elements are present in which genes, and at what locations. Using the underlying principle that target genes of a transcription factor should share the same binding site in their promoter regions, we propose a scoring method that facilitates the refinement of a candidate GRN. Improvements are demonstrated with three data sets, on which GRNs are first obtained from existing dataset (AtRegNet) or using an existing approach (ARACNe), and then modified using cis-element information.
Projects
-
Meowth
Developed a cloud-based smart litter box which monitors cat activities
and body temperatures. – Python, Anaren atmosphere, Google Engine -
Hit Stone
-
Hit Stone – a data visualization server for sequential and binary combinatorial patterns – Nodejs, RESTfull Sever, Flask, Python
-
Catworks
-
categorical attributes similarity learning: Given the relationship among attributes from different groups, we proposed an iterative attributes similarity learning using modified KL divergence.
Performance is significantly better than BAM, and Binary Relevance + various classifiers. -
SAXTIME
-
A combinatorial patterns recognition algorithm for timeseries
data: Proposed a combinatorial shape-patterns recognition approach using
wavelet transformation and symbolic aggregation approximation.
(presented in SU Research Pitch Competition: 3rd Place Winner) -
Bigwords
-
Amazon Alexa accepted skill: I developed a Alexa skill for people to learn
and test synonyms in English. – Python, Amazon lambda server, Amazon E2C server -
HiPSiS
-
Associate histone combinatorial patterns with gene functions: We create a hybrid classifier by creating a single convolutional layer of frequent item set to improve label prediction performance.
(published on BICOB 17’) -
aptamer hunter
-
A modulo operation based hashing method for nucleotide sequences
for efficient indexing, counting and neighborhood search.
(published on ASONAM 14’) -
TFTS
-
Utilizing Cis-elements to Refine Gene Regulatory Network:
I Incorporated sequence analysis into network link prediction and
achieved higher performance. For each node in the network, we studied
the distribution of target sequences in the network and compare
with a global prior distribution to evaluate the significance. The algorithm
iteratively update the edges in the network until convergence.
(published on BIBM 13’) -
NLP Course Project
-
A key word based boolean rule learning system for Chinese
law documents which extracts some of the obvious rules of behaviors and potential
consequences -
Course Project - Webpage autonomous crawler
-
Taobao page extractor: Developed a html data extractor for website-wide price, review crawling using recurrent subtree structure detection. – Java
-
Efficient Classification of Binary Data Stream with Concept Drifting
-
Using Conjunction Rule Based Boolean Classifier:
Proposed an efficient binary streaming data classifier with low memory
footprint. For each dimension of streaming data, we create a confusion
matrix and update the rules using binomial distribution analysis.
Where the importance and inter-dependencies among dimensions
are updated with incoming new data points which achieved similar
performance compared with ILDA with high efficiency.
(published on IEA-AIE 15’)
Honors & Awards
-
Syracuse University Poster Competition: Best Poster award in EECS
Syracuse University
http://eng-cs.syr.edu/news-events/news/student-innovation-recognized-at-2017-research-day/
Yiou Xiao and Diksha Shukla tied for the best poster in the BMCE poster competition. Yiou Xiao’s poster explained his research on “Prediction of Biological Functions by Histone Modification Patterns Profiling” and Diksha Shukla’s poster focused on how “Your Smartphone Security is at Risk -
Syracuse University Research Pitch Competition: 3rd place in Engineering School.
Syracuse University
http://eng-cs.syr.edu/news-events/news/student-innovation-recognized-at-2017-research-day/
There was a tie for third place. Electrical Engineering and Computer Science (EECS) student Yiou Xiao G’11 won for his presentation on “Prediction of Biological Functions by Histone Modification Patterns Profiling.” Pranay Sharma, also an EECS student, won for his presentation on “Inferring Communication Network Topology via Transfer Entropy.” -
Syracuse TeHack hackthon top prize (smart cat litter box)
Syracuse On-center
http://syracusecoe.syr.edu/techack-winners/
-
Syracuse University Fellow Award (2012- 2016)
Syracuse University Graduate School
I received the fellow grant for 4 years because of my excellence in academic performance and research progress.
-
Outstanding Graduate Student in CS (GPA top 1)
Syracuse University
Graduated with master degree and received the prize for excellent academic performance.
Languages
-
English
Professional working proficiency
-
Chinese
Native or bilingual proficiency
Organizations
-
IEEE
no
-
More activity by Yiou
-
Meet Nasir.AI! 🏆 Overall 2nd place of the LLM x Law Hackathon 🏆 Winners of the CodeX, The Stanford Center for Legal Informatics prize Members:…
Meet Nasir.AI! 🏆 Overall 2nd place of the LLM x Law Hackathon 🏆 Winners of the CodeX, The Stanford Center for Legal Informatics prize Members:…
Liked by Yiou X.
-
🐼 . . . . . Original seen on /ProgrammingHumor on Reddit.
🐼 . . . . . Original seen on /ProgrammingHumor on Reddit.
Liked by Yiou X.
-
Join us in celebrating a legend! Electrical Engineering and Computer Science Professor Shiu-Kai Chin is retiring after an incredible career here at…
Join us in celebrating a legend! Electrical Engineering and Computer Science Professor Shiu-Kai Chin is retiring after an incredible career here at…
Liked by Yiou X.
-
I have some bittersweet news to share today: today will be my last day at Kumo.AI. I'm so grateful for an incredible experience over the last 8…
I have some bittersweet news to share today: today will be my last day at Kumo.AI. I'm so grateful for an incredible experience over the last 8…
Liked by Yiou X.
-
It was great meeting with Hema Raghavan and David Mohr at the Kumo.AI booth at the Databricks Summit! The best part was that they were very friendly…
It was great meeting with Hema Raghavan and David Mohr at the Kumo.AI booth at the Databricks Summit! The best part was that they were very friendly…
Liked by Yiou X.
-
Following our recent announcement of Kumo as a Native app in Snowflake, our technical blog post is now out! We talk about how we built Kumo's…
Following our recent announcement of Kumo as a Native app in Snowflake, our technical blog post is now out! We talk about how we built Kumo's…
Liked by Yiou X.
-
Kumo.AI is excited to be a Snow Row partner! Come find out more at our booth at the #DataCloudSummit next week!
Kumo.AI is excited to be a Snow Row partner! Come find out more at our booth at the #DataCloudSummit next week!
Liked by Yiou X.
-
I had an amazing weekend at the Illinois State Special Olympics with my son in Normal, IL. Leo brought home a gold medal in the 100m freestyle and a…
I had an amazing weekend at the Illinois State Special Olympics with my son in Normal, IL. Leo brought home a gold medal in the 100m freestyle and a…
Liked by Yiou X.
-
🌟 Exciting News! 🌟 Kumo is now a native app in Snowflake Marketplace! 🚀 Unlock the power of AI-driven ML models with Kumo, combining graph…
🌟 Exciting News! 🌟 Kumo is now a native app in Snowflake Marketplace! 🚀 Unlock the power of AI-driven ML models with Kumo, combining graph…
Liked by Yiou X.
-
A day in life as a salesperson #snowflake #snowflakedatasummit
A day in life as a salesperson #snowflake #snowflakedatasummit
Liked by Yiou X.
Other similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore More