Yiou X.

Sunnyvale, California, United States Contact Info
938 followers 500+ connections

Join to view profile

About

At Kumo.AI, my focus is on harnessing the power of graph neural networks to unravel…

Activity

Join now to see all activity

Experience & Education

  • Kumo.AI

View Yiou’s full experience

See their title, tenure and more.

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Licenses & Certifications

Publications

  • Prediction of biological functions by histone modification patterns profiling

    Bicob

    Histone modifications provide an important layer of gene regulation in eukaryotes. In this paper, we propose an approach that identifies the histone modification patterns most relevant for specific biological functions, such as flowering in plants. We first propose a new pattern scoring method, which evaluates the importance of each combinatorial pattern of histone modifications; this is used along with logistic regression, Support Vector Machines, and naive Bayesian classifier algorithms to…

    Histone modifications provide an important layer of gene regulation in eukaryotes. In this paper, we propose an approach that identifies the histone modification patterns most relevant for specific biological functions, such as flowering in plants. We first propose a new pattern scoring method, which evaluates the importance of each combinatorial pattern of histone modifications; this is used along with logistic regression, Support Vector Machines, and naive Bayesian classifier algorithms to predict gene functions. This approach is shown to be successful in inferring significant patterns verified by independent gene function data, outperforming other pattern scores used in current histone modification analysis research.

    See publication
  • Efficient Classification of Binary Data Stream with Concept Drifting Using Conjunction Rule based Boolean Classifier

    IEA-AIE/Springer

    We propose a conjunction rule based classification technique that has good classification performance, is simple, automatically identifies important attributes, and is extremely fast. Due to these properties the classifier is most suitable for “big” streaming data. Empirical study, using multiple datasets, shows that time complexity, compared with other classifiers, is faster by several factors, especially for large number of attributes without sacrificing performance.

    See publication
  • A fast sorting algorithm for aptamer identification using deep sequencing

    ASONAM/IEEE

    Abstract:
    In recent years, with the advent of fast sequencing technology, the genomic database is growing rapidly. Researchers in the bioinformatics field are expecting faster and more accurate tools to effectively analyze the gigantic data sets. In the context of aptamer search, the goal is to search for the over-represented DNA sequences from the randomly generated aptamer libraries. Hash functions are widely used in substring comparison, sequence alignment and clustering tools. We have…

    Abstract:
    In recent years, with the advent of fast sequencing technology, the genomic database is growing rapidly. Researchers in the bioinformatics field are expecting faster and more accurate tools to effectively analyze the gigantic data sets. In the context of aptamer search, the goal is to search for the over-represented DNA sequences from the randomly generated aptamer libraries. Hash functions are widely used in substring comparison, sequence alignment and clustering tools. We have developed a light-weight tool that takes advantage of the hash functions to reduce the size of genomic data and conducts η-neighbor searches on the centroid sequence. This greatly improves the efficiency of the search compared with existing tools. Furthermore, the prior calculation of hash values of η-neighbors decreases the searching overhead. In a dataset of 2.23 million sequences, the proposed algorithm accurately count the frequency of the Human α-Thrombin aptamer sequences in less than 40 seconds, whereas the current script-based method takes 2 hours and 18 minutes.

    See publication
  • Utilizing cis-element to refine gene regulatory network

    BIBM/IEEE

    Gene regulatory networks (GRNs) describe epistatic relationship of genes and how the expression of some genes influence the expression of other genes. This information is critical for understanding molecular mechanisms regulating various biological processes and molecular basis of several diseases. Current research work mostly attempts to infer such regulatory relationships (and GRN architecture topology) from gene expression data. This paper improves on this methodology by utilizing additional…

    Gene regulatory networks (GRNs) describe epistatic relationship of genes and how the expression of some genes influence the expression of other genes. This information is critical for understanding molecular mechanisms regulating various biological processes and molecular basis of several diseases. Current research work mostly attempts to infer such regulatory relationships (and GRN architecture topology) from gene expression data. This paper improves on this methodology by utilizing additional information available that describes which cis-elements are present in which genes, and at what locations. Using the underlying principle that target genes of a transcription factor should share the same binding site in their promoter regions, we propose a scoring method that facilitates the refinement of a candidate GRN. Improvements are demonstrated with three data sets, on which GRNs are first obtained from existing dataset (AtRegNet) or using an existing approach (ARACNe), and then modified using cis-element information.

    See publication

Projects

  • Meowth

    Developed a cloud-based smart litter box which monitors cat activities
    and body temperatures. – Python, Anaren atmosphere, Google Engine

    See project
  • Hit Stone

    -

    Hit Stone – a data visualization server for sequential and binary combinatorial patterns – Nodejs, RESTfull Sever, Flask, Python

    See project
  • Catworks

    -

    categorical attributes similarity learning: Given the relationship among attributes from different groups, we proposed an iterative attributes similarity learning using modified KL divergence.
    Performance is significantly better than BAM, and Binary Relevance + various classifiers.

  • SAXTIME

    -

    A combinatorial patterns recognition algorithm for timeseries
    data: Proposed a combinatorial shape-patterns recognition approach using
    wavelet transformation and symbolic aggregation approximation.
    (presented in SU Research Pitch Competition: 3rd Place Winner)

    See project
  • Bigwords

    -

    Amazon Alexa accepted skill: I developed a Alexa skill for people to learn
    and test synonyms in English. – Python, Amazon lambda server, Amazon E2C server

    See project
  • HiPSiS

    -

    Associate histone combinatorial patterns with gene functions: We create a hybrid classifier by creating a single convolutional layer of frequent item set to improve label prediction performance.
    (published on BICOB 17’)

    See project
  • aptamer hunter

    -

    A modulo operation based hashing method for nucleotide sequences
    for efficient indexing, counting and neighborhood search.
    (published on ASONAM 14’)

    See project
  • TFTS

    -

    Utilizing Cis-elements to Refine Gene Regulatory Network:
    I Incorporated sequence analysis into network link prediction and
    achieved higher performance. For each node in the network, we studied
    the distribution of target sequences in the network and compare
    with a global prior distribution to evaluate the significance. The algorithm
    iteratively update the edges in the network until convergence.
    (published on BIBM 13’)

  • NLP Course Project

    -

    A key word based boolean rule learning system for Chinese
    law documents which extracts some of the obvious rules of behaviors and potential
    consequences

  • Course Project - Webpage autonomous crawler

    -

    Taobao page extractor: Developed a html data extractor for website-wide price, review crawling using recurrent subtree structure detection. – Java

  • Efficient Classification of Binary Data Stream with Concept Drifting

    -

    Using Conjunction Rule Based Boolean Classifier:
    Proposed an efficient binary streaming data classifier with low memory
    footprint. For each dimension of streaming data, we create a confusion
    matrix and update the rules using binomial distribution analysis.
    Where the importance and inter-dependencies among dimensions
    are updated with incoming new data points which achieved similar
    performance compared with ILDA with high efficiency.
    (published on IEA-AIE 15’)

    See project

Honors & Awards

  • Syracuse University Poster Competition: Best Poster award in EECS

    Syracuse University

    http://eng-cs.syr.edu/news-events/news/student-innovation-recognized-at-2017-research-day/
    Yiou Xiao and Diksha Shukla tied for the best poster in the BMCE poster competition. Yiou Xiao’s poster explained his research on “Prediction of Biological Functions by Histone Modification Patterns Profiling” and Diksha Shukla’s poster focused on how “Your Smartphone Security is at Risk

  • Syracuse University Research Pitch Competition: 3rd place in Engineering School.

    Syracuse University

    http://eng-cs.syr.edu/news-events/news/student-innovation-recognized-at-2017-research-day/
    There was a tie for third place. Electrical Engineering and Computer Science (EECS) student Yiou Xiao G’11 won for his presentation on “Prediction of Biological Functions by Histone Modification Patterns Profiling.” Pranay Sharma, also an EECS student, won for his presentation on “Inferring Communication Network Topology via Transfer Entropy.”

  • Syracuse TeHack hackthon top prize (smart cat litter box)

    Syracuse On-center

    http://syracusecoe.syr.edu/techack-winners/

  • Syracuse University Fellow Award (2012- 2016)

    Syracuse University Graduate School

    I received the fellow grant for 4 years because of my excellence in academic performance and research progress.

  • Outstanding Graduate Student in CS (GPA top 1)

    Syracuse University

    Graduated with master degree and received the prize for excellent academic performance.

Languages

  • English

    Professional working proficiency

  • Chinese

    Native or bilingual proficiency

Organizations

  • IEEE

    no

    -

More activity by Yiou

View Yiou’s full profile

  • See who you know in common
  • Get introduced
  • Contact Yiou directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Add new skills with these courses