SlideShare a Scribd company logo
Lexical & Query
Semantics Differences
for Information
Retrieval
Why PageRank is Sometimes Better
for Semantics
Closing the Gap between Search
Query Language and Document
Language
• There are three components of Information
Retrieval Systems.
• Query Understanding
• Document-Query Relevance
Understanding
• Document Clustering and Ranking
• The path from a “search query” to a “search
document” involves query parsing, processing,
augmenting, scoring, ranking and clustering.
• Query Understanding is where the SEO starts.
• Document Creation is where the SEO continues.
• Document Ranking where the SEO repeats itself.
Source: Query Language Determination Using Query Terms and Interface Language
What is this Search
Query Language?
• Search Query Language is invented in
Cranfield Experiments in late 1950s.
• Scientists realized that while “querying a
document”, the language gets densified, and
words change their meaning.
• There is a huge vocabulary difference between
“queries” and “documents.
• Because, people do not know what to ask for a
search engine, they only know what
represents the topic.
• The “query language” uses “knowledge
representation” with “dense vectors”.
• Query Term Weight Calculation is born during
these experiments.
Source: Augmenting Queries With Synonyms From Synonyms Map
Query Search
Language
• Cranfield Experiments: Cyril W. Cleverdon is one of the first
Information Retrieval experiments.
• It is for testing the efficiency of indexing systems.
• The “Vannevar Bush’s ‘As we may think’” paper is cited during the
research.
• The Cranfield Experiments invented the “Search Language” concept
to admit the fact that words change their meanings inside the
search queries even if they are used same inside the document.
• Information Retrieval has to make a distinction between
“understanding relevance” and “understanding query”.
• To understand the query, search engine can’t use the language
model for understanding the documents.
• Document language and query language are completely different.
• Inside the documents, we see “lexical semantics”.
• Inside the queries, we see “query semantics” with “search
language”.
Source: “As We May Think” – Vannevar Bush; Cranfield Experiments, Cyril W. Cleverdon, 1958.

Recommended for you

Semantic seo and the evolution of queries
Semantic seo and the evolution of queriesSemantic seo and the evolution of queries
Semantic seo and the evolution of queries

This document summarizes how Google search results are evolving to include more semantic data through direct answers, structured snippets, and rich snippets. It provides examples of direct answers being extracted from authoritative sources using natural language queries and intent templates. It also discusses how including structured data like tables, schemas, and markup can help search engines understand and display page content in a more standardized way. While knowledge-based trust is an interesting concept, current search ranking still primarily relies on link analysis and does not consider factual correctness.

SEO & Patents Vrtualcon v. 3.0
SEO & Patents Vrtualcon v. 3.0SEO & Patents Vrtualcon v. 3.0
SEO & Patents Vrtualcon v. 3.0

A look at search-related patents from Google that people who do SEO may be interested in learning about

knowledge graphentitiesfeatured snippets
How to Automatically Subcategorise Your Website Automatically With Python
How to Automatically Subcategorise Your Website Automatically With PythonHow to Automatically Subcategorise Your Website Automatically With Python
How to Automatically Subcategorise Your Website Automatically With Python

The document describes a Python script that can automatically generate new subcategories for an ecommerce website based on clustering product names. It discusses: - Using NLTK to generate n-grams from product names to cluster related products - Filtering the n-grams to keep only those with commercial value by checking for search volume and CPC data - Running the script on a large home improvement site to identify over 1,650 new subcategory opportunities with a total search volume of over 13 million - Sharing the script so others can automate subcategory identification for their own sites to scale up an important SEO tactic.

pythonbrightonseoseo
An Algorithm doesn’t have
to be liked by your logic
• An algorithm doesn’t have to make sense.
• An algorithm has to be useful.
• Cranfield Experiments is debated for decades, and still it
is cited by new researches.
• Cranfield Experiments do not explain why their method
is working, it just tells, it works.
• The experiments tell test subjects to take documents
from “aerospace” topic, and write some “keywords”, or
“search queries” for “aerospace” topic.
• Test subjects rank the documents based on their own
query terms and their own judgement.
• Cranfield Experiments has created the concepts of
“search language” and “document language” along with
“natural language query”.
Source: Query Generation Using Structural Similarity Between Documents
Lexical Semantics
• Lexicosemantics involves word-sense
disambiguation with word copositionality and
language syntax-semantics interface.
• Lexicosemantics helps Formal Semantics (Natural
Language).
• Formal Semantics studies grammatical meaning of
natural language with theoretical computer
science.
• Lexical Semantics helps for construction of
WordNets, FrameNets, Knowledge Bases and
Index Tiers.
• Lexical Semantics is useful for Search Engines to
process a text item to understand “Semantic
Scope” of sentences with “modality”, “tense”,
“binding”, “aspect”, and pragmatics.
• Lexical semantics involve, hyponymy, hypernymy,
antonomy, homonymy, polysemy, meronymy,
holonym and semantic networks.
Source: Query Generation Using Structural Similarity Between Documents
Do You Remember Google Merge?
• What if Google became a
semantic search engine by buying
another one?
• Oingo was the first search engine
focused on meaning-based
relevance and advertisement.
• They became “Applied Semantics”
in 2001.
• Google and Applied Semantics
merged together on 18 April,
2003.
Applied Semantics (Oingo): The First
Conceptual Search Engine
• Applied Semantics is created by Eytan Elbaz in 1999.
• Information Extraction and Information Responsiveness work
differently than Information Retrieval.
• Lexical Relations do not have the meaning in query terms, but
Query Semantics have. Thus, to augment and expand a query,
query semantics are used first time.
• It is one of the first designs that mention “semantic distance”,
and “relationship strength” to create a semantic network of
concepts.
• It created the way to “Index Tiering”.
Typically, search engines match the search terms to the documents as a whole. If the user is interested in
specific information, for example, “sharks”, but a particular document about “beaches around the
world”, for example, only has one sentence about sharks, it is unlikely that the search engine would return the
document. Documents like the one described are likely to score very low under the query for “sharks”, if at all,
because the document as a whole is not “about” sharks.
Source: Methods and systems for detecting and extracting information

Recommended for you

Entity Seo Mastery
Entity Seo MasteryEntity Seo Mastery
Entity Seo Mastery

1) Knowledge graphs are structured databases that represent real-world entities and their relationships to each other. They help search engines like Google understand topics at a deeper level. 2) Entities (topics) are becoming more important than keywords for search engines to understand content. Google's entity understanding can be checked using their natural language processing tool. 3) Semantic SEO techniques like tightly linking topics both internally and to relevant external pages can help improve how search engines understand and represent the entities within a website through their knowledge graphs.

seointernet marketingdixon jones
The Python Cheat Sheet for the Busy Marketer
The Python Cheat Sheet for the Busy MarketerThe Python Cheat Sheet for the Busy Marketer
The Python Cheat Sheet for the Busy Marketer

What percentage of an Inbound marketer's day doesn't involve working with spreadsheets? How much of this work is time-consuming and repetitive? In this interactive session, you will learn how to manipulate Google Sheets to automate common data analysis workflows using Python, a very easy to use programming language.

pythonautomationsheets
William slawski-google-patents- how-do-they-influence-search
William slawski-google-patents- how-do-they-influence-searchWilliam slawski-google-patents- how-do-they-influence-search
William slawski-google-patents- how-do-they-influence-search

Bill Slawski presented a webinar on analyzing patents related to search engines and SEO. He discussed 12 Google patents covering topics like PageRank, Google's news ranking algorithm, analyzing images to detect brand penetration, and building user location history. The patents described Google's work in building knowledge graphs from web pages, ranking entities in search results, question answering, and determining quality visits to local businesses.

Do You Remember Google Merge?
• Similarity (“gluttonous” is similar to “greedy”) – Near Synonyms
• Membership (“commissioner” is a member of “commission”)
• Metonymy (whole/part relations) (“motor vehicle” has part
“clutch pedal”)
• Substance (e.g. “lumber” has substance “wood”)
• Product (e.g. “Microsoft Corporation” produces “Microsoft
Access”)
• Attribute (“past”, “preceding” are attributes of “timing”)
• Causation (e.g. travel causes displacement/motion)
• Entailment (e.g. buying entails paying)
• Lateral bonds (concepts closely related to one another, but not
in one of the other relationships, e.g. “dog” and “dog collar”)
• Capitonyms (Polish (Nation), polish (shining).
• Troponym (Walking -> Hustle, Trot, Crawl)
• Eponym (Tommy John Surgery, Biswanath Panda -> Panda Update)
• Demonym (New Yorkers -> Population of New York City, not
State)
• Acronyms (NASA, North American Saxophone Alliance,
National Auto Sport Association, National Association of
Students of Architecture)
•Source: Bill Slawski
Formal Semantics
• Formal Semantics involves philosophy of language and
linguistics together.
• Denotations of natural language expressions are used to
understand the compositionality of words, and their
references.
• Nature of meaning is the philosophical part of formal
semantics.
• Nature of meaning involves the meanings that come from
our nature (Constructivist, Coherence, Correspondence,
Consensus, Pragmatic Theories).
• Formal Semantics have two approaches.
• Truth Conditions
• Compositionality
• Formal Semantics is related to Lexical Semantics, because
based on lexical relations, the compositionality, and truth
conditions change.
Formal Semantics and Inquisitive Semantics
• Inquisitive Semantics involve raising new but related
issues to a truth value.
• For example: “Aspirin is used against headache. Does it
work against toothache?”
• The “toothache” and “headache” here have lexical
relations to each other as “meronyms”.
• The Formal Semantics here helps to understand the
truth value of “Aspirin” and its functions.
• The Formal Semantics and Truth Conditions have two
approaches.
• Dynamic Semantics: The raised issues have to change the
context, and the first premise has to be correct.
• Static Semantics: The raised issue doesn’t have to be
relevant, and premise doesn’t have to be true.
• For example: “John gives SEO Suggestions as a Googler.
Does John gives useful SEO Suggestions as a Googler?”
• Technically, John’s occupation is not connected to the
suggestions’ usefulness.
• The Dynamic Semantics change the context of the
previous sentence based on interpreter and receiver.
• Multi-stage or chained reasoning is highly relevant to
the Dynamic Semantics for “context direction”.
Source: Multi-level Recommendation Reasoning over Knowledge Graphs with
Reinforcement Learning
Formal Semantics and Compositionality
• Compsoitionality is to understand
lexical relations between the
subjects and objects.
• The easiest way to have a formal
semantics understanding for
compositionality is removing all the
meaningful lexical units from the
sentence.
• For the sentence “Contadu is the
best technology for creating a
semantical understanding to
optimize content”.
• “C is t-t for s-u to o-c”.
• The structure here gives the composition
of words, and how lexical relations are
constructed with constituent rules.
Source: Compositionality by Henk J. Verkuyl, Utrecht University

Recommended for you

SEO Case Study - Hangikredi.com From 12 March to 24 September Core Update
SEO Case Study - Hangikredi.com From 12 March to 24 September Core UpdateSEO Case Study - Hangikredi.com From 12 March to 24 September Core Update
SEO Case Study - Hangikredi.com From 12 March to 24 September Core Update

This document provides SEO metrics and comparisons for the website hangikredi.com over several time periods between April 2019 and September 2019. It shows substantial increases in key metrics like organic traffic, clicks, impressions, and average position after Google algorithm updates in May, June, July, and September. However, it also shows significant drops in these metrics during a server outage in early August. Overall the data demonstrates the site's strong SEO performance and organic growth over the 6-month period analyzed.

search engine optimizationcase studyseo
Antifragility in Digital Marketing
Antifragility in Digital MarketingAntifragility in Digital Marketing
Antifragility in Digital Marketing

This document discusses digital marketing strategies focused on establishing authority through valuable, timeless content. It recommends creating content such as articles, videos, and academic papers on topics that will remain relevant for years to establish expertise. Creating a steady stream of high-quality content over time builds an online presence and credibility without major risks of losses, and may lead to job offers, clients, or other opportunities. It provides examples of interactive dashboards and open-source software that gained popularity and users by continuously publishing improvements and documentation without needing to rely on things like resumes or company profiles.

digital marketingonline marketingstrategy
BrightonSEO March 2021 | Dan Taylor, Image Entity Tags
BrightonSEO March 2021 | Dan Taylor, Image Entity TagsBrightonSEO March 2021 | Dan Taylor, Image Entity Tags
BrightonSEO March 2021 | Dan Taylor, Image Entity Tags

My talk from BrightonSEO 2021; focusing on using Google's image category labels (glancing into the Knowledge Graph and Google's image annotation processes) for better topic research and content optimization.

seoimage seoimage category labels
Formal Semantics and Scope
• Scope determines the validity of the specific
declaration’s range.
• Formal semantics helps machines to process
the human language for understanding the
specific scope.
• For example:
• “Every student has a favourite teacher”. -> It is not
clear whether every student has the same teacher
as their favourite or, all of them have different
teachers as their favourite, or some of them have
same teacher, and some of them have different
teachers as their favourite.
• “When three more votes are taken from the court,
the decision will be as we want.” -> The not clear
part here is that, why 3, and which 3. Does the
court have different layers of officials with
different vote values, or especially “X, Y, Z”
officials needed to vote, and which other
decision-givers are against the decision that the
person wants.  This is the example of Inquisitive
Semantics, use it for question generation.
• There are other types of scopes, such as “scope
islands”, “exceptional scopes”.
Source: Context-Sensitivity and Individual Differences in the Derivation of Scalar
Implicature
Formal Semantics and Scope
• Island Scopes are called Island because
they can’t be taken out of that scope
(island).
• For example: “If every elephant in the
sanctuary gains 5 pounds every next 6
months, I will get a promotion”.  The
person doesn’t get another promotion
whenever an elephant gains 5 pounds for
every 6 months. It happens once.
• Exceptional Scope reverses the scope
islands with “a” indefinite.
• For example, “If an elephant gains 5
pounds, I will take a promotion”  The
disambiguous, and repetitiveness occur
together.
• Scope is important for Compositionality,
and Compositionality is important for
Lexical Semantics.
Source: Creation of inferred queries for use as query suggestions
Formal Semantics and Modality
• Modaliy is part of Formal Semantics
with propositional content, and
philosophical logic. There are
different modalities:
• Permissible: Express the acts that are
allowed.
• Possible: Express the acts that are
possible.
• Quintessential: Express the acts’
features.
• Evidential: Express the facts with
factual source.
• Habitual: Express the habits.
• Iterative: Express the repeated acts.
• Frequentative: Express the permanent
facts.
Source: Semantic frame identification with distributed word representations
Formal Semantics and Binding
• Binding is creating a bound between the predicate and the subject. The anaphors are used to express the
connections between bound predicates and subjects.
• Modality express the lexical relations’ features while binding is for lexical relations’ direction.
• The sentence of “Nancy Pelosi must be next presidential candidate for her career”, the “must be” involves
“strong possibility” while “career” is bound to “Nancy Pelosi”.
• The set theory works here to create “People who must be next candidates for presential election” set, and
“being a presidential candidate” as a possible “political career improvement” act, and “presidential
candidate” becomes a topic that involves connections to other types of “candidacies”, while “political career
steps”, and “political discussions” are connected to it.
• The binding and modality works to create an Information Graph, together.
• If the sentence changes as “Nancy Pelosi is the best possible candidate for every democrat in the US.”, the
sentence has a possibility from a different “modality”, and concept of “scope” works here again.
• Declaration tells that “Nancy Pelosi is a candidate” for “every Democrat in the US”. This explains the “scope”
and “compositionality”.
• Compositionality here is “N is a c for e d in the U.S”
• The main issue here is that the scope doesn’t make sense. If a Democrat goes outside of the US, does it mean
that “Nancy Pelosi is suddenly not the best candidate” anymore? Or, is he best candidate for every democrat
literally?
• Thus, the scope here affects the “modality” further, and makes the “possibility” “opinioated” rather than a
“factual possibility”.
• The Formal Semantics Components affect each other.
• The output of the Formal Semantics affect the Lexical Semantics.
• Lexical Semantics affect the Lexical Relations.
• Lexical Relations affect the Information Graph, and Extraction.
• Information Extraction determines the Knowledge Base (Raw Knowledge Graph). Source: Providing result-based query suggestions

Recommended for you

Semantic Publishing and Entity SEO - Conteference 20-11-2022
Semantic Publishing and Entity SEO - Conteference 20-11-2022Semantic Publishing and Entity SEO - Conteference 20-11-2022
Semantic Publishing and Entity SEO - Conteference 20-11-2022

Semantic Publishing is publishing a page on the Internet by adding a semantic layer (i.e., semantic enrichment) in the form of structured data that describes the page itself.

semantic publishingseosemantic seo
Everything You Didn't Know About Entity SEO
Everything You Didn't Know About Entity SEO Everything You Didn't Know About Entity SEO
Everything You Didn't Know About Entity SEO

This document provides an overview of entity SEO, including: - What an entity is and why entity SEO is important as search engines have evolved from information engines to knowledge engines - How search algorithms like Panda, Penguin, and Hummingbird helped drive this transition by prioritizing high-quality content over low-quality sites - Techniques for entity SEO including entity research, topical maps, schema, internal linking, and case studies - Tools like Google's Knowledge Graph that can help with entity research and understanding how entities are ranked

seoknowldege graphsemantic web
Entity seo
Entity seoEntity seo
Entity seo

How to approach SEO in a world where Google has moved from strings and keywords to things, topics and entities. Dixon JOnes is the CEO of InLinks, who have build a proprietory NLP algorithm and Knowledge Graph designed for the SEO Industry.

seoentitiessearch engine optimization
Formal Semantics and T-A-M (Tense-Aspect-
Mood)
• Tense-aspect-mood has different combinations
to extract information, and relate
lexicosemantics to each other within a data
graph.
• Tense involves the position of the action inside the
timeline.
• Past, Present, Future
• Aspect involves extension of the state of action in
timeline.
• Unitary – Happened once and suddenly.
• Continuous – Happens during the time.
• Repeated – Happened repeatedly, will happen again.
• Continuous
• Mood (modality) involves the actuality of action.
• Possibly: Might happen.
• Necessity: Should happen.
Source: Extracting Semantic Classes from Text
Transition from Lexical Semantics to Query
Semantics
• Query Semantics and Lexical Semantics are
different from each other but highly similar.
• Lexically synonym words might appear
irrelevant to each other, while in Query
Semantics they are relevant.
• For example, “Buy” and “Sell” are opposites, or
antonyms for each other.
• In Query Semantics, “Buy” and “Sell” are
synonyms, in other words, they mean the same
thing.
• The “Soft Drinks” is different concept than
“Coca Cola”. The “Soft Drinks” is a hypernym
for Coca Cola in Lexical Semantics, but in Query
Semantics, they might be synonyms.
Transition from Lexical Semantics to Query
Semantics
• Query Semantics is used for “Query
Inference”, and “Query Phrasification”.
• The Query “Best temperature for Soft
Drink” is a query for a hypernym in
Lexical Semantics.
• Query Semantics is used to generate
the same search query for other
members of the same set, because at
the same time, they are synonyms in
query semantics.
• “Soft drinks such as Coca Cola” and “Coca
Cola (Soft Drink)” doesn’t represent the
same thing in Query Semantics.
• Second phrase is more relevant to “Coca
Cola”, while the first one is more relevant
to entire “class of things”.
Transition from Lexical Semantics to Query
Semantics
• “Best temperature for pepsi” query
requires further query processing
with lexicosemantics and query
semantics.
• “Best temperature for pepsi” has
missing part.
• For drinking
• For serving
• For producing
• For storing
• For Mixing
• All the possible “verbs” come form
“lexical semantics” and how they are
used in “query search” language.

Recommended for you

Semantic search Bill Slawski DEEP SEA Con
Semantic search Bill Slawski DEEP SEA ConSemantic search Bill Slawski DEEP SEA Con
Semantic search Bill Slawski DEEP SEA Con

1) Google uses various techniques to extract structured information like entities, relationships, and properties from unstructured text on the web and databases. This extracted information is then used to generate knowledge graphs and provide augmented responses to user queries. 2) One key technique is to identify patterns in which tuples of information are stored in databases, and then extract additional tuples by repeating the process and utilizing the identified patterns. 3) Google also extracts entities from user queries and may generate a knowledge graph to answer questions by providing information about the entities from sources like its own knowledge graph and information extracted from the web.

schemaentitiesknowledge
Bill Slawski SEO and the New Search Results
Bill Slawski   SEO and the New Search ResultsBill Slawski   SEO and the New Search Results
Bill Slawski SEO and the New Search Results

Google's search results now include entities and concepts. Entities refer to people, places, things, and 20-30% of queries are for name entities. Google uses meta data like Freebase to build a taxonomy of entities and their relationships. This supports features like the Knowledge Graph, which provides information panels, and allows querying of nearby entities which may soon be available in search results.

Python for SEO
Python for SEOPython for SEO
Python for SEO

The document discusses using Python for SEO applications such as data extraction, preparation, analysis, machine learning and deep learning. It provides an agenda and examples of using Python to solve challenging SEO problems from site migrations and traffic losses. Methods demonstrated include pulling data from Google Analytics, storing in DataFrames, regular expression grouping, and training machine learning models on page features to classify page groups and identify losses. Later sections discuss using deep learning with computer vision models to classify web pages from screenshots.

seotechnical seopython
Formal Semantics and T-A-M (Tense-Aspect-
Mood)
• Formal Semantics and T-A-M affect lexical
semantics.
• The “tense”, “aspect” and “mood”
combinations create different lexical
relations with contexts.
Transition from Lexical Semantics to Query
Semantics
• The smallest query and word differences
can create ranking changes,
• even if search intent is same,
• or they mean same thing.
Compositionality by Henk J. Verkuyl, Utrecht University
what should happen to someone who has hemophilia
what can happen to someone who has hemophilia
what happens to someone who has hemophilia
Formal Semantics and T-A-M (Tense-Aspect-
Mood)
• The modality “should” represent a
responsibility, and solution for a problem.
• Thus, result focuses on “treatment” or
“precaution”, even if rest of the sentence is
same.
what should not happen to someone
who has hemophilia
what will not happen to someone who
has hemophilia
what happened to someone who has
hemophilia
Formal Semantics and T-A-M (Tense-Aspect-
Mood)
• The lemmatization such
as “effected”, and
“effective” bring answers
closers.
• The predicate “show” is
closer to “demonstrate”,
and “metrics”, or “tests”.
• The predicates, and
possible
compositionalities have
different types of themes.
what shows happen to someone who has hemophilia
what effected to someone who has hemophilia

Recommended for you

Internal Linking - The Topic Clustering Way edited.pptx
Internal Linking - The Topic Clustering Way edited.pptxInternal Linking - The Topic Clustering Way edited.pptx
Internal Linking - The Topic Clustering Way edited.pptx

This document discusses internal linking strategies and techniques. It begins by explaining the benefits of connecting entities within content, rather than just words, and translating those connections into internal links. It then provides an overview of technologies like PageRank, the reasonable surfer algorithm, topical PageRank, chunking, and natural language processing that search engines use to understand contexts and how those ideas can be applied to internal linking at scale. Specific options for approaches to internal linking existing pages are also outlined.

Passage indexing is likely more important than you think
Passage indexing is likely more important than you thinkPassage indexing is likely more important than you think
Passage indexing is likely more important than you think

Whilst passage indexing may seem like a small tweak to search ranking, it is potentially much more symptomatic of the beginning of a fundamental shift in the way that search engines understand unstructured content, determine relevance in natural language, and rank efficiently and effectively. It could also be a means of assessing overall quality of content and a means of dynamic index pruning. We will look at the landscape, and also provide some takeaways for brands and business owners looking to improve quality in unstructured content overall in this fast changing landscape.

seosearch engine optimizationsearch engine marketing
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...

English dictionaries since 1755 have attempted to present succinct statements of the meaning(s) of each word. A word may have more than one meaning but, so the theory goes, each meaning can in principle be summarized in a neat paraphrase that is substitutable (in context) for the target word (the definiendum). Such paraphrases must be so worded that the the substitution can be made without changing the truth of what is said – salva veritate, in Leibniz’s famous phrase. Building on Leibniz, philosophers of language such as Anna Wierzbicka have argued that the duty of the lexicographer is to “seek the invariant”. In this presentation, I argue that this view of word meaning and definition may be all very well as a principle for developing stipulative definitions of terminology in scientific discourse, but it has led to serious misunderstandings about the nature of meaning in natural language, creating insuperable obstacles for the understanding of how word meaning works. As a result, linguists from Bloomfield to Chomsky and philosophers of language from Leibniz to Russell – great thinkers all – have been unable to say anything true or useful about meaning in language. I argue that, instead, lexicographers should aim to discover patterns of word use in large corpora, and associate meanings with patterns instead of (or as well as) words in isolation. They should also distinguish normal uses of each word from exploitations of norms.

Query-Document
Vocabulary Gap
Query-Document
Vocabulary Gap
Query-Document
Vocabulary Gap
Query-Document
Vocabulary Gap

Recommended for you

Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing

This lectures provides students with an introduction to natural language processing, with a specific focus on the basics of two applications: vector semantics and text classification. (Lecture at the QUARTZ PhD Winter School (http://www.quartz-itn.eu/training/winter-school/ in Padua, Italy on February 12, 2018)

natural language processingnlplecture
Ontology
OntologyOntology
Ontology

study or concern about what kinds of things exist what entities there are in the universe. the ontology derives from the Greek onto (being) and logia (written or spoken). It is a branch of metaphysics , the study of first principles or the root of things.

Language and the Law by Paul Danon, UK
Language and the Law by Paul Danon, UKLanguage and the Law by Paul Danon, UK
Language and the Law by Paul Danon, UK

Exploring the US 2010 Plain Language Act and other countries are exploring options. Paul Danon, UK compares guides and discusses what's out there, the need for collaboration and ethical implications.

iplday oct 13paul danonplain language
Query-Document
Vocabulary Gap
Query-Document
Vocabulary Gap
Query Semantics
Query Semantics

Recommended for you

Lecture: Semantic Word Clouds
Lecture: Semantic Word CloudsLecture: Semantic Word Clouds
Lecture: Semantic Word Clouds

folksonomy, social tagging, tag clouds, automatic folksonomy construction, word clouds, wordle,context-preserving word cloud visualisation, CPEWCV, seam carving, inflate and push, star forest, cycle cover, quantitative metrics, realized adjacencies, distortion, area utilization, compactness, aspect ratio, running time, semantics in language technology

seam carvingquantitative metricswordle
EDS for IFLA
EDS for IFLAEDS for IFLA
EDS for IFLA

The document summarizes discovery services adoption rates among libraries. EBSCO's discovery service, EDS, has the most subscribers with over 5,600 libraries. OCLC reports over 1,700 libraries have access to WorldCat Local, though fewer use it as their primary interface. Ex Libris has licensed Primo to over 1,400 libraries, and ProQuest reports 673 libraries using Summon. The document also discusses features of EDS, including integration with library catalogs and course management systems, relevance ranking, and development of applications using the EDS API.

Seminar on legal reading, research, writing
Seminar on legal reading, research, writingSeminar on legal reading, research, writing
Seminar on legal reading, research, writing

This document discusses strategies for legal reading, research, and writing. It begins by exploring how people read texts, maps, and music, and how these insights could apply to reading law. It then addresses organizing legal research using citation managers. Finally, it provides guidance on academic legal writing, including different forms of writing, strategies for writing within constraints, planning approaches, and addressing introductions and problem-solving writing specifically. Throughout, it draws on research and references various scholars to support its discussion.

legal researchlegal writinglegal reading
Query Semantics
• We also see that, “Cat” and “Dog” can
be synonyms.
• Part-time and Full-time can be
synonyms.
• But, sometimes they are also not
synonyms.
• For the query “find job”, they might be
synonym.
• For the query “buy pet”, they might be
synonynm.
• But for the “dog food”, it is not synonym.
• “Sign in” and “Sign on” might be or
might not be synonym.
• “Address” might be contact, or just the
address as well.
Query Semantics
• New York is not York.
• York Hotels doesn’t mean New
York Hotels.
• But, Vegas is always Las Vegas.
• If you search from Latin
America, York is New York.
• If you search from Africa, still,
York is New York.
• If you search from France, it is
50/50.
• If you search from UK, it is not
New York, again.
Query Semantics
• “New” appears alone a lot.
• “York” appears without “New”
sometimes.
• The combination of phrases
from the Documents help
search engines to relate these
things to each other, or
differentiate them.
• How documents use the query
phrases determine how people
search.
• How people search affect how
people use query phrases.
Query Semantics
• Bonus: Does it worth to
index?
• Even if 1,000,000 searches
happen everyday?
• What are the synonyms of
facial expressions?

Recommended for you

Nlp
NlpNlp
Nlp

The document discusses natural language and natural language processing (NLP). It defines natural language as languages used for everyday communication like English, Japanese, and Swahili. NLP is concerned with enabling computers to understand and interpret natural languages. The summary explains that NLP involves morphological, syntactic, semantic, and pragmatic analysis of text to extract meaning and understand context. The goal of NLP is to allow humans to communicate with computers using their own language.

14. Michael Oakes (UoW) Natural Language Processing for Translation
14. Michael Oakes (UoW) Natural Language Processing for Translation14. Michael Oakes (UoW) Natural Language Processing for Translation
14. Michael Oakes (UoW) Natural Language Processing for Translation

This document discusses information retrieval and describes its three main phases: 1) asking a question to define an information need, 2) constructing an answer by matching queries to documents, and 3) assessing the relevance of the retrieved answers. It also covers several important information retrieval concepts like keywords, indexing documents, stemming words, calculating TF-IDF weights, and evaluating system performance using recall and precision.

What can a corpus tell us about discourse
What can a corpus tell us about discourseWhat can a corpus tell us about discourse
What can a corpus tell us about discourse

RECURSOS, HERRAMIENTAS Y NUEVAS TECNOLOGÍAS PARA LOS ESTUDIOS INGLESES Practice session 3 Group B. Pascual Pérez-Paredes

corpus linguistics
Query Semantics
• “Prove the cost is worth it”.
• Do you worth for that cost if
you do not use
lexicosemantics?
Let’s talk about “porn”.
• This is Matt Cutts.
• His first big task on Google was
“finding spammy” but sometimes
not spammy, but highly “sexual
queries”.
• Why?
• S A F E S E A R C H.
Let’s talk about “porn”.
• And, how to find all these porns?
• How do people search porn?
• Matt Cutts was an expert on Web
Spam, because adult websites use
spam a lot.
• “Tink two times, if your manager
asks you that what do you think
about porn.”
• -Matt Cutts
Let’s talk about “porn”.
• Matt Cutts used 69 languages, and
synonyms to find good phrases that can
relate porns.
• “I didn’t think about this before. People
search porn with lots of different weird
words.”
• Matt Cutts tried to convince Google
Employees to search porn with weird
ways.
• He distributed “cookies”, this is how
“Google Cookie Porn” events happened.
• Lexicosemantics and Query Semantics
are tested first time with entire Google.

Recommended for you

Pragmatic linguistics»
Pragmatic linguistics»Pragmatic linguistics»
Pragmatic linguistics»

Pragmatics is the study of language use and context. It examines how the context, both situational and linguistic, affects the meaning of utterances. An utterance is the smallest unit of speech studied in pragmatics. Pragmatics focuses on the speaker's intended meaning rather than just the grammatical form. The interpretation of an utterance depends on its semantic content and environment. Contextual factors like the social and situational background condition both the production and understanding of utterances.

Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1

This document discusses natural language processing (NLP) and language modeling. It covers the basics of NLP including what NLP is, its common applications, and basic NLP processing steps like parsing. It also discusses word and sentence modeling in NLP, including word representations using techniques like bag-of-words, word embeddings, and language modeling approaches like n-grams, statistical modeling, and neural networks. The document focuses on introducing fundamental NLP concepts.

nlpdeep learningartificial intelligence
Relationship of Descriptive Linguistics in the following areas [Autosaved].pptx
Relationship of Descriptive Linguistics in the following areas [Autosaved].pptxRelationship of Descriptive Linguistics in the following areas [Autosaved].pptx
Relationship of Descriptive Linguistics in the following areas [Autosaved].pptx

This document discusses several key concepts in linguistics including: 1. The autonomy of syntax - the theory that syntax operates independently of meaning and pragmatics. 2. Compositionality - the principle that the meaning of a phrase or sentence can be derived from the meanings of its parts and their structure. 3. Conservative vs innovative forms - conservative forms change little over time while innovative forms undergo more recent changes. 4. Prescriptivism - the belief that there are correct and incorrect ways to use language based on explicit rules imposed on speakers. 5. Methods of linguistic research include collecting primary and secondary data using tools like interviews, observations, and questionnaires for qualitative and quantitative analysis.

relationship of descriptive li
Some Case Studies
http://ktg.digital/Holistic-SEO-20
Kanbanize.com
Some Case Studies
http://ktg.digital/Holistic-SEO-20
TheCooList.com
Some Case Studies
http://ktg.digital/Holistic-SEO-20
TheComplaintsBoard.com
Some Case Studies
http://ktg.digital/Holistic-SEO-20
Vava.cars

Recommended for you

Dove, "A Model of the User's Psychological State as a Framework for Understan...
Dove, "A Model of the User's Psychological State as a Framework for Understan...Dove, "A Model of the User's Psychological State as a Framework for Understan...
Dove, "A Model of the User's Psychological State as a Framework for Understan...

This presentation was provided by John G. Dove of Credo Reference during the NISO event "Next Generation Discovery Tools: New Tools, Aging Standards," held March 27 - March 28, 2008.

niso forumpsychologyframework
SLSguide
SLSguideSLSguide
SLSguide

This document provides an overview of the legal research process. It defines what law is and discusses the different types of legal authorities, including statutes passed by Congress, regulations by executive agencies, and case law interpretations by courts. It then outlines the steps to conduct legal research, including developing search terms, choosing appropriate research tools like legal databases and books, searching strategically, evaluating sources, and refining searches. The document provides examples of searching legal databases like LexisNexis and Westlaw and managing citations. The overall process is iterative, beginning with forming a research question and repeating searches across different tools and terms until enough information is found.

legal researchlibrarylibraries
chapter2 Know.representation.pptx
chapter2 Know.representation.pptxchapter2 Know.representation.pptx
chapter2 Know.representation.pptx

The document discusses knowledge representation (KR) and different approaches to KR, including: 1) KR provides a surrogate for reasoning about the world by representing knowledge in a computable format. It determines how an agent thinks about the world. 2) Logics like propositional and predicate/first-order logic use symbols and rules to represent knowledge unambiguously, though they have limitations in expressiveness. 3) Semantic networks, frames, and conceptual graphs are other non-logical KR that focus on expressiveness, simplicity, and formality over logic-based representations. They provide flexible ways to represent objects, attributes, and relationships.

Some Case Studies
http://ktg.digital/Holistic-SEO-20
Diyetkolik.com
Some Case Studies
http://ktg.digital/Holistic-SEO-20
NDA.
Some Case Studies
http://ktg.digital/Holistic-SEO-20
K9web.com

More Related Content

What's hot

Keyword Research and Topic Modeling in a Semantic Web
Keyword Research and Topic Modeling in a Semantic WebKeyword Research and Topic Modeling in a Semantic Web
Keyword Research and Topic Modeling in a Semantic Web
Bill Slawski
 
Quality Content at Scale Through Automated Text Summarization of UGC
Quality Content at Scale Through Automated Text Summarization of UGCQuality Content at Scale Through Automated Text Summarization of UGC
Quality Content at Scale Through Automated Text Summarization of UGC
Hamlet Batista
 
Slawski New Approaches for Structured Data:Evolution of Question Answering
Slawski   New Approaches for Structured Data:Evolution of Question Answering Slawski   New Approaches for Structured Data:Evolution of Question Answering
Slawski New Approaches for Structured Data:Evolution of Question Answering
Bill Slawski
 
Semantic seo and the evolution of queries
Semantic seo and the evolution of queriesSemantic seo and the evolution of queries
Semantic seo and the evolution of queries
Bill Slawski
 
SEO & Patents Vrtualcon v. 3.0
SEO & Patents Vrtualcon v. 3.0SEO & Patents Vrtualcon v. 3.0
SEO & Patents Vrtualcon v. 3.0
Bill Slawski
 
How to Automatically Subcategorise Your Website Automatically With Python
How to Automatically Subcategorise Your Website Automatically With PythonHow to Automatically Subcategorise Your Website Automatically With Python
How to Automatically Subcategorise Your Website Automatically With Python
searchsolved
 
Entity Seo Mastery
Entity Seo MasteryEntity Seo Mastery
Entity Seo Mastery
Dixon Jones
 
The Python Cheat Sheet for the Busy Marketer
The Python Cheat Sheet for the Busy MarketerThe Python Cheat Sheet for the Busy Marketer
The Python Cheat Sheet for the Busy Marketer
Hamlet Batista
 
William slawski-google-patents- how-do-they-influence-search
William slawski-google-patents- how-do-they-influence-searchWilliam slawski-google-patents- how-do-they-influence-search
William slawski-google-patents- how-do-they-influence-search
Bill Slawski
 
SEO Case Study - Hangikredi.com From 12 March to 24 September Core Update
SEO Case Study - Hangikredi.com From 12 March to 24 September Core UpdateSEO Case Study - Hangikredi.com From 12 March to 24 September Core Update
SEO Case Study - Hangikredi.com From 12 March to 24 September Core Update
Koray Tugberk GUBUR
 
Antifragility in Digital Marketing
Antifragility in Digital MarketingAntifragility in Digital Marketing
Antifragility in Digital Marketing
Elias Dabbas
 
BrightonSEO March 2021 | Dan Taylor, Image Entity Tags
BrightonSEO March 2021 | Dan Taylor, Image Entity TagsBrightonSEO March 2021 | Dan Taylor, Image Entity Tags
BrightonSEO March 2021 | Dan Taylor, Image Entity Tags
Dan Taylor
 
Semantic Publishing and Entity SEO - Conteference 20-11-2022
Semantic Publishing and Entity SEO - Conteference 20-11-2022Semantic Publishing and Entity SEO - Conteference 20-11-2022
Semantic Publishing and Entity SEO - Conteference 20-11-2022
Massimiliano Geraci
 
Everything You Didn't Know About Entity SEO
Everything You Didn't Know About Entity SEO Everything You Didn't Know About Entity SEO
Everything You Didn't Know About Entity SEO
Sara Taher
 
Entity seo
Entity seoEntity seo
Entity seo
Dixon Jones
 
Semantic search Bill Slawski DEEP SEA Con
Semantic search Bill Slawski DEEP SEA ConSemantic search Bill Slawski DEEP SEA Con
Semantic search Bill Slawski DEEP SEA Con
Bill Slawski
 
Bill Slawski SEO and the New Search Results
Bill Slawski   SEO and the New Search ResultsBill Slawski   SEO and the New Search Results
Bill Slawski SEO and the New Search Results
Bill Slawski
 
Python for SEO
Python for SEOPython for SEO
Python for SEO
Hamlet Batista
 
Internal Linking - The Topic Clustering Way edited.pptx
Internal Linking - The Topic Clustering Way edited.pptxInternal Linking - The Topic Clustering Way edited.pptx
Internal Linking - The Topic Clustering Way edited.pptx
Dixon Jones
 
Passage indexing is likely more important than you think
Passage indexing is likely more important than you thinkPassage indexing is likely more important than you think
Passage indexing is likely more important than you think
Dawn Anderson MSc DigM
 

What's hot (20)

Keyword Research and Topic Modeling in a Semantic Web
Keyword Research and Topic Modeling in a Semantic WebKeyword Research and Topic Modeling in a Semantic Web
Keyword Research and Topic Modeling in a Semantic Web
 
Quality Content at Scale Through Automated Text Summarization of UGC
Quality Content at Scale Through Automated Text Summarization of UGCQuality Content at Scale Through Automated Text Summarization of UGC
Quality Content at Scale Through Automated Text Summarization of UGC
 
Slawski New Approaches for Structured Data:Evolution of Question Answering
Slawski   New Approaches for Structured Data:Evolution of Question Answering Slawski   New Approaches for Structured Data:Evolution of Question Answering
Slawski New Approaches for Structured Data:Evolution of Question Answering
 
Semantic seo and the evolution of queries
Semantic seo and the evolution of queriesSemantic seo and the evolution of queries
Semantic seo and the evolution of queries
 
SEO & Patents Vrtualcon v. 3.0
SEO & Patents Vrtualcon v. 3.0SEO & Patents Vrtualcon v. 3.0
SEO & Patents Vrtualcon v. 3.0
 
How to Automatically Subcategorise Your Website Automatically With Python
How to Automatically Subcategorise Your Website Automatically With PythonHow to Automatically Subcategorise Your Website Automatically With Python
How to Automatically Subcategorise Your Website Automatically With Python
 
Entity Seo Mastery
Entity Seo MasteryEntity Seo Mastery
Entity Seo Mastery
 
The Python Cheat Sheet for the Busy Marketer
The Python Cheat Sheet for the Busy MarketerThe Python Cheat Sheet for the Busy Marketer
The Python Cheat Sheet for the Busy Marketer
 
William slawski-google-patents- how-do-they-influence-search
William slawski-google-patents- how-do-they-influence-searchWilliam slawski-google-patents- how-do-they-influence-search
William slawski-google-patents- how-do-they-influence-search
 
SEO Case Study - Hangikredi.com From 12 March to 24 September Core Update
SEO Case Study - Hangikredi.com From 12 March to 24 September Core UpdateSEO Case Study - Hangikredi.com From 12 March to 24 September Core Update
SEO Case Study - Hangikredi.com From 12 March to 24 September Core Update
 
Antifragility in Digital Marketing
Antifragility in Digital MarketingAntifragility in Digital Marketing
Antifragility in Digital Marketing
 
BrightonSEO March 2021 | Dan Taylor, Image Entity Tags
BrightonSEO March 2021 | Dan Taylor, Image Entity TagsBrightonSEO March 2021 | Dan Taylor, Image Entity Tags
BrightonSEO March 2021 | Dan Taylor, Image Entity Tags
 
Semantic Publishing and Entity SEO - Conteference 20-11-2022
Semantic Publishing and Entity SEO - Conteference 20-11-2022Semantic Publishing and Entity SEO - Conteference 20-11-2022
Semantic Publishing and Entity SEO - Conteference 20-11-2022
 
Everything You Didn't Know About Entity SEO
Everything You Didn't Know About Entity SEO Everything You Didn't Know About Entity SEO
Everything You Didn't Know About Entity SEO
 
Entity seo
Entity seoEntity seo
Entity seo
 
Semantic search Bill Slawski DEEP SEA Con
Semantic search Bill Slawski DEEP SEA ConSemantic search Bill Slawski DEEP SEA Con
Semantic search Bill Slawski DEEP SEA Con
 
Bill Slawski SEO and the New Search Results
Bill Slawski   SEO and the New Search ResultsBill Slawski   SEO and the New Search Results
Bill Slawski SEO and the New Search Results
 
Python for SEO
Python for SEOPython for SEO
Python for SEO
 
Internal Linking - The Topic Clustering Way edited.pptx
Internal Linking - The Topic Clustering Way edited.pptxInternal Linking - The Topic Clustering Way edited.pptx
Internal Linking - The Topic Clustering Way edited.pptx
 
Passage indexing is likely more important than you think
Passage indexing is likely more important than you thinkPassage indexing is likely more important than you think
Passage indexing is likely more important than you think
 

Similar to Lexical Semantics, Semantic Similarity and Relevance for SEO

Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Scottish Language Dictionaries
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
Toine Bogers
 
Ontology
OntologyOntology
Ontology
Ahmed Tememe
 
Language and the Law by Paul Danon, UK
Language and the Law by Paul Danon, UKLanguage and the Law by Paul Danon, UK
Language and the Law by Paul Danon, UK
International Plain Language Day
 
Lecture: Semantic Word Clouds
Lecture: Semantic Word CloudsLecture: Semantic Word Clouds
Lecture: Semantic Word Clouds
Marina Santini
 
EDS for IFLA
EDS for IFLAEDS for IFLA
EDS for IFLA
CliveRWright
 
Seminar on legal reading, research, writing
Seminar on legal reading, research, writingSeminar on legal reading, research, writing
Nlp
NlpNlp
14. Michael Oakes (UoW) Natural Language Processing for Translation
14. Michael Oakes (UoW) Natural Language Processing for Translation14. Michael Oakes (UoW) Natural Language Processing for Translation
14. Michael Oakes (UoW) Natural Language Processing for Translation
RIILP
 
What can a corpus tell us about discourse
What can a corpus tell us about discourseWhat can a corpus tell us about discourse
What can a corpus tell us about discourse
Pascual Pérez-Paredes
 
Pragmatic linguistics»
Pragmatic linguistics»Pragmatic linguistics»
Pragmatic linguistics»
Miguel Seura
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Saurabh Kaushik
 
Relationship of Descriptive Linguistics in the following areas [Autosaved].pptx
Relationship of Descriptive Linguistics in the following areas [Autosaved].pptxRelationship of Descriptive Linguistics in the following areas [Autosaved].pptx
Relationship of Descriptive Linguistics in the following areas [Autosaved].pptx
EnKhi1
 
Dove, "A Model of the User's Psychological State as a Framework for Understan...
Dove, "A Model of the User's Psychological State as a Framework for Understan...Dove, "A Model of the User's Psychological State as a Framework for Understan...
Dove, "A Model of the User's Psychological State as a Framework for Understan...
National Information Standards Organization (NISO)
 
SLSguide
SLSguideSLSguide
SLSguide
Annelise Sklar
 
chapter2 Know.representation.pptx
chapter2 Know.representation.pptxchapter2 Know.representation.pptx
chapter2 Know.representation.pptx
wendifrawtadesse1
 
Library Research for Legal Researchers at UCSD
Library Research for Legal Researchers at UCSDLibrary Research for Legal Researchers at UCSD
Library Research for Legal Researchers at UCSD
Annelise Sklar
 
Haas and Flower Slideshow for Composition II
Haas and Flower Slideshow for Composition IIHaas and Flower Slideshow for Composition II
Haas and Flower Slideshow for Composition II
rslyons
 
Sls guide2018
Sls guide2018Sls guide2018
Sls guide2018
Annelise Sklar
 
EDS for JIBS
EDS for JIBSEDS for JIBS
EDS for JIBS
CliveRWright
 

Similar to Lexical Semantics, Semantic Similarity and Relevance for SEO (20)

Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Ontology
OntologyOntology
Ontology
 
Language and the Law by Paul Danon, UK
Language and the Law by Paul Danon, UKLanguage and the Law by Paul Danon, UK
Language and the Law by Paul Danon, UK
 
Lecture: Semantic Word Clouds
Lecture: Semantic Word CloudsLecture: Semantic Word Clouds
Lecture: Semantic Word Clouds
 
EDS for IFLA
EDS for IFLAEDS for IFLA
EDS for IFLA
 
Seminar on legal reading, research, writing
Seminar on legal reading, research, writingSeminar on legal reading, research, writing
Seminar on legal reading, research, writing
 
Nlp
NlpNlp
Nlp
 
14. Michael Oakes (UoW) Natural Language Processing for Translation
14. Michael Oakes (UoW) Natural Language Processing for Translation14. Michael Oakes (UoW) Natural Language Processing for Translation
14. Michael Oakes (UoW) Natural Language Processing for Translation
 
What can a corpus tell us about discourse
What can a corpus tell us about discourseWhat can a corpus tell us about discourse
What can a corpus tell us about discourse
 
Pragmatic linguistics»
Pragmatic linguistics»Pragmatic linguistics»
Pragmatic linguistics»
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
 
Relationship of Descriptive Linguistics in the following areas [Autosaved].pptx
Relationship of Descriptive Linguistics in the following areas [Autosaved].pptxRelationship of Descriptive Linguistics in the following areas [Autosaved].pptx
Relationship of Descriptive Linguistics in the following areas [Autosaved].pptx
 
Dove, "A Model of the User's Psychological State as a Framework for Understan...
Dove, "A Model of the User's Psychological State as a Framework for Understan...Dove, "A Model of the User's Psychological State as a Framework for Understan...
Dove, "A Model of the User's Psychological State as a Framework for Understan...
 
SLSguide
SLSguideSLSguide
SLSguide
 
chapter2 Know.representation.pptx
chapter2 Know.representation.pptxchapter2 Know.representation.pptx
chapter2 Know.representation.pptx
 
Library Research for Legal Researchers at UCSD
Library Research for Legal Researchers at UCSDLibrary Research for Legal Researchers at UCSD
Library Research for Legal Researchers at UCSD
 
Haas and Flower Slideshow for Composition II
Haas and Flower Slideshow for Composition IIHaas and Flower Slideshow for Composition II
Haas and Flower Slideshow for Composition II
 
Sls guide2018
Sls guide2018Sls guide2018
Sls guide2018
 
EDS for JIBS
EDS for JIBSEDS for JIBS
EDS for JIBS
 

Recently uploaded

Brand Repositioning & Communication Presentation
Brand Repositioning & Communication PresentationBrand Repositioning & Communication Presentation
Brand Repositioning & Communication Presentation
Rajesh Math
 
Content Optimization Master Class - Matt Raven
Content Optimization Master Class - Matt RavenContent Optimization Master Class - Matt Raven
The Top 6 Facebook Ad Hacks of 2024, Targeting the Untargetable - Larry Kim
The Top 6 Facebook Ad Hacks of 2024, Targeting the Untargetable - Larry KimThe Top 6 Facebook Ad Hacks of 2024, Targeting the Untargetable - Larry Kim
The Top 6 Facebook Ad Hacks of 2024, Targeting the Untargetable - Larry Kim
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
How Can German Auto Repair Shops Benefit From Digital Marketing
How Can German Auto Repair Shops Benefit From Digital MarketingHow Can German Auto Repair Shops Benefit From Digital Marketing
How Can German Auto Repair Shops Benefit From Digital Marketing
German Repair Shop Marketing
 
Free Healthcare Marketing Plan for Healthcare professionals
Free Healthcare Marketing Plan for Healthcare professionalsFree Healthcare Marketing Plan for Healthcare professionals
Free Healthcare Marketing Plan for Healthcare professionals
Mazhar Shah
 
[Webinar - VWO] AI-First Strategies to Drive Traffic and Conversions for 2024...
[Webinar - VWO] AI-First Strategies to Drive Traffic and Conversions for 2024...[Webinar - VWO] AI-First Strategies to Drive Traffic and Conversions for 2024...
[Webinar - VWO] AI-First Strategies to Drive Traffic and Conversions for 2024...
VWO
 
10 Advantages and Disadvantages of Social Media Marketing in 2024
10 Advantages and Disadvantages of Social Media Marketing in 202410 Advantages and Disadvantages of Social Media Marketing in 2024
10 Advantages and Disadvantages of Social Media Marketing in 2024
Markonik
 
Go To Market Strategy - Zig When Others Zag
Go To Market Strategy - Zig When Others ZagGo To Market Strategy - Zig When Others Zag
Go To Market Strategy - Zig When Others Zag
Rajesh Math
 
10 Powerful Strategies to Solve Common Payroll Problems in SMEs_.pdf
10 Powerful Strategies to Solve Common Payroll Problems in SMEs_.pdf10 Powerful Strategies to Solve Common Payroll Problems in SMEs_.pdf
10 Powerful Strategies to Solve Common Payroll Problems in SMEs_.pdf
Top Klickz
 
Mobile Marketing in the form of ppt document
Mobile Marketing in the form of ppt documentMobile Marketing in the form of ppt document
Mobile Marketing in the form of ppt document
ArhamBaloch1
 
Tools, Systems, & Websites to Grow a Profitable Business on Social Media - Ta...
Tools, Systems, & Websites to Grow a Profitable Business on Social Media - Ta...Tools, Systems, & Websites to Grow a Profitable Business on Social Media - Ta...
Tools, Systems, & Websites to Grow a Profitable Business on Social Media - Ta...
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
SEO for Revenue, Grow Your Business, Not Just Your Rankings - Dale Bertrand
SEO for Revenue, Grow Your Business, Not Just Your Rankings - Dale BertrandSEO for Revenue, Grow Your Business, Not Just Your Rankings - Dale Bertrand
SEO for Revenue, Grow Your Business, Not Just Your Rankings - Dale Bertrand
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
Paid Media Targeting in a Cookieless Future - Kevin Lee
Paid Media Targeting in a Cookieless Future - Kevin LeePaid Media Targeting in a Cookieless Future - Kevin Lee
Paid Media Targeting in a Cookieless Future - Kevin Lee
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
Revolutionizing Advertising with Billion Broadcaster Standee Screen Media
Revolutionizing Advertising with Billion Broadcaster Standee Screen MediaRevolutionizing Advertising with Billion Broadcaster Standee Screen Media
Revolutionizing Advertising with Billion Broadcaster Standee Screen Media
VikasYadav194549
 
1704373070-KIM_-_ITI_ELSS_Tax_Saver_Fund.pdf
1704373070-KIM_-_ITI_ELSS_Tax_Saver_Fund.pdf1704373070-KIM_-_ITI_ELSS_Tax_Saver_Fund.pdf
1704373070-KIM_-_ITI_ELSS_Tax_Saver_Fund.pdf
Kaushal445159
 
An Odyssey into Composable Digital Solutions - Brian McKeiver
An Odyssey into Composable Digital Solutions - Brian McKeiverAn Odyssey into Composable Digital Solutions - Brian McKeiver
An Odyssey into Composable Digital Solutions - Brian McKeiver
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
Traditional Foods Of Australia and The History
Traditional Foods Of Australia and The HistoryTraditional Foods Of Australia and The History
Traditional Foods Of Australia and The History
The Aussie Way
 
Chemical Industry- Rashtriya Chemical Fertilizers (RCF) .pptx
Chemical Industry- Rashtriya Chemical Fertilizers (RCF) .pptxChemical Industry- Rashtriya Chemical Fertilizers (RCF) .pptx
Chemical Industry- Rashtriya Chemical Fertilizers (RCF) .pptx
mayurparate000
 
TAM AdEx-Quarterly Report on Radio Advertising_2024.pdf
TAM AdEx-Quarterly Report on Radio Advertising_2024.pdfTAM AdEx-Quarterly Report on Radio Advertising_2024.pdf
TAM AdEx-Quarterly Report on Radio Advertising_2024.pdf
Social Samosa
 
PPC and SEO Synergies - Strategies Every Company Should Deploy - Benjamin Lund
PPC and SEO Synergies - Strategies Every Company Should Deploy - Benjamin LundPPC and SEO Synergies - Strategies Every Company Should Deploy - Benjamin Lund
PPC and SEO Synergies - Strategies Every Company Should Deploy - Benjamin Lund
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 

Recently uploaded (20)

Brand Repositioning & Communication Presentation
Brand Repositioning & Communication PresentationBrand Repositioning & Communication Presentation
Brand Repositioning & Communication Presentation
 
Content Optimization Master Class - Matt Raven
Content Optimization Master Class - Matt RavenContent Optimization Master Class - Matt Raven
Content Optimization Master Class - Matt Raven
 
The Top 6 Facebook Ad Hacks of 2024, Targeting the Untargetable - Larry Kim
The Top 6 Facebook Ad Hacks of 2024, Targeting the Untargetable - Larry KimThe Top 6 Facebook Ad Hacks of 2024, Targeting the Untargetable - Larry Kim
The Top 6 Facebook Ad Hacks of 2024, Targeting the Untargetable - Larry Kim
 
How Can German Auto Repair Shops Benefit From Digital Marketing
How Can German Auto Repair Shops Benefit From Digital MarketingHow Can German Auto Repair Shops Benefit From Digital Marketing
How Can German Auto Repair Shops Benefit From Digital Marketing
 
Free Healthcare Marketing Plan for Healthcare professionals
Free Healthcare Marketing Plan for Healthcare professionalsFree Healthcare Marketing Plan for Healthcare professionals
Free Healthcare Marketing Plan for Healthcare professionals
 
[Webinar - VWO] AI-First Strategies to Drive Traffic and Conversions for 2024...
[Webinar - VWO] AI-First Strategies to Drive Traffic and Conversions for 2024...[Webinar - VWO] AI-First Strategies to Drive Traffic and Conversions for 2024...
[Webinar - VWO] AI-First Strategies to Drive Traffic and Conversions for 2024...
 
10 Advantages and Disadvantages of Social Media Marketing in 2024
10 Advantages and Disadvantages of Social Media Marketing in 202410 Advantages and Disadvantages of Social Media Marketing in 2024
10 Advantages and Disadvantages of Social Media Marketing in 2024
 
Go To Market Strategy - Zig When Others Zag
Go To Market Strategy - Zig When Others ZagGo To Market Strategy - Zig When Others Zag
Go To Market Strategy - Zig When Others Zag
 
10 Powerful Strategies to Solve Common Payroll Problems in SMEs_.pdf
10 Powerful Strategies to Solve Common Payroll Problems in SMEs_.pdf10 Powerful Strategies to Solve Common Payroll Problems in SMEs_.pdf
10 Powerful Strategies to Solve Common Payroll Problems in SMEs_.pdf
 
Mobile Marketing in the form of ppt document
Mobile Marketing in the form of ppt documentMobile Marketing in the form of ppt document
Mobile Marketing in the form of ppt document
 
Tools, Systems, & Websites to Grow a Profitable Business on Social Media - Ta...
Tools, Systems, & Websites to Grow a Profitable Business on Social Media - Ta...Tools, Systems, & Websites to Grow a Profitable Business on Social Media - Ta...
Tools, Systems, & Websites to Grow a Profitable Business on Social Media - Ta...
 
SEO for Revenue, Grow Your Business, Not Just Your Rankings - Dale Bertrand
SEO for Revenue, Grow Your Business, Not Just Your Rankings - Dale BertrandSEO for Revenue, Grow Your Business, Not Just Your Rankings - Dale Bertrand
SEO for Revenue, Grow Your Business, Not Just Your Rankings - Dale Bertrand
 
Paid Media Targeting in a Cookieless Future - Kevin Lee
Paid Media Targeting in a Cookieless Future - Kevin LeePaid Media Targeting in a Cookieless Future - Kevin Lee
Paid Media Targeting in a Cookieless Future - Kevin Lee
 
Revolutionizing Advertising with Billion Broadcaster Standee Screen Media
Revolutionizing Advertising with Billion Broadcaster Standee Screen MediaRevolutionizing Advertising with Billion Broadcaster Standee Screen Media
Revolutionizing Advertising with Billion Broadcaster Standee Screen Media
 
1704373070-KIM_-_ITI_ELSS_Tax_Saver_Fund.pdf
1704373070-KIM_-_ITI_ELSS_Tax_Saver_Fund.pdf1704373070-KIM_-_ITI_ELSS_Tax_Saver_Fund.pdf
1704373070-KIM_-_ITI_ELSS_Tax_Saver_Fund.pdf
 
An Odyssey into Composable Digital Solutions - Brian McKeiver
An Odyssey into Composable Digital Solutions - Brian McKeiverAn Odyssey into Composable Digital Solutions - Brian McKeiver
An Odyssey into Composable Digital Solutions - Brian McKeiver
 
Traditional Foods Of Australia and The History
Traditional Foods Of Australia and The HistoryTraditional Foods Of Australia and The History
Traditional Foods Of Australia and The History
 
Chemical Industry- Rashtriya Chemical Fertilizers (RCF) .pptx
Chemical Industry- Rashtriya Chemical Fertilizers (RCF) .pptxChemical Industry- Rashtriya Chemical Fertilizers (RCF) .pptx
Chemical Industry- Rashtriya Chemical Fertilizers (RCF) .pptx
 
TAM AdEx-Quarterly Report on Radio Advertising_2024.pdf
TAM AdEx-Quarterly Report on Radio Advertising_2024.pdfTAM AdEx-Quarterly Report on Radio Advertising_2024.pdf
TAM AdEx-Quarterly Report on Radio Advertising_2024.pdf
 
PPC and SEO Synergies - Strategies Every Company Should Deploy - Benjamin Lund
PPC and SEO Synergies - Strategies Every Company Should Deploy - Benjamin LundPPC and SEO Synergies - Strategies Every Company Should Deploy - Benjamin Lund
PPC and SEO Synergies - Strategies Every Company Should Deploy - Benjamin Lund
 

Lexical Semantics, Semantic Similarity and Relevance for SEO

  • 1. Lexical & Query Semantics Differences for Information Retrieval Why PageRank is Sometimes Better for Semantics
  • 2. Closing the Gap between Search Query Language and Document Language • There are three components of Information Retrieval Systems. • Query Understanding • Document-Query Relevance Understanding • Document Clustering and Ranking • The path from a “search query” to a “search document” involves query parsing, processing, augmenting, scoring, ranking and clustering. • Query Understanding is where the SEO starts. • Document Creation is where the SEO continues. • Document Ranking where the SEO repeats itself. Source: Query Language Determination Using Query Terms and Interface Language
  • 3. What is this Search Query Language? • Search Query Language is invented in Cranfield Experiments in late 1950s. • Scientists realized that while “querying a document”, the language gets densified, and words change their meaning. • There is a huge vocabulary difference between “queries” and “documents. • Because, people do not know what to ask for a search engine, they only know what represents the topic. • The “query language” uses “knowledge representation” with “dense vectors”. • Query Term Weight Calculation is born during these experiments. Source: Augmenting Queries With Synonyms From Synonyms Map
  • 4. Query Search Language • Cranfield Experiments: Cyril W. Cleverdon is one of the first Information Retrieval experiments. • It is for testing the efficiency of indexing systems. • The “Vannevar Bush’s ‘As we may think’” paper is cited during the research. • The Cranfield Experiments invented the “Search Language” concept to admit the fact that words change their meanings inside the search queries even if they are used same inside the document. • Information Retrieval has to make a distinction between “understanding relevance” and “understanding query”. • To understand the query, search engine can’t use the language model for understanding the documents. • Document language and query language are completely different. • Inside the documents, we see “lexical semantics”. • Inside the queries, we see “query semantics” with “search language”. Source: “As We May Think” – Vannevar Bush; Cranfield Experiments, Cyril W. Cleverdon, 1958.
  • 5. An Algorithm doesn’t have to be liked by your logic • An algorithm doesn’t have to make sense. • An algorithm has to be useful. • Cranfield Experiments is debated for decades, and still it is cited by new researches. • Cranfield Experiments do not explain why their method is working, it just tells, it works. • The experiments tell test subjects to take documents from “aerospace” topic, and write some “keywords”, or “search queries” for “aerospace” topic. • Test subjects rank the documents based on their own query terms and their own judgement. • Cranfield Experiments has created the concepts of “search language” and “document language” along with “natural language query”. Source: Query Generation Using Structural Similarity Between Documents
  • 6. Lexical Semantics • Lexicosemantics involves word-sense disambiguation with word copositionality and language syntax-semantics interface. • Lexicosemantics helps Formal Semantics (Natural Language). • Formal Semantics studies grammatical meaning of natural language with theoretical computer science. • Lexical Semantics helps for construction of WordNets, FrameNets, Knowledge Bases and Index Tiers. • Lexical Semantics is useful for Search Engines to process a text item to understand “Semantic Scope” of sentences with “modality”, “tense”, “binding”, “aspect”, and pragmatics. • Lexical semantics involve, hyponymy, hypernymy, antonomy, homonymy, polysemy, meronymy, holonym and semantic networks. Source: Query Generation Using Structural Similarity Between Documents
  • 7. Do You Remember Google Merge? • What if Google became a semantic search engine by buying another one? • Oingo was the first search engine focused on meaning-based relevance and advertisement. • They became “Applied Semantics” in 2001. • Google and Applied Semantics merged together on 18 April, 2003.
  • 8. Applied Semantics (Oingo): The First Conceptual Search Engine • Applied Semantics is created by Eytan Elbaz in 1999. • Information Extraction and Information Responsiveness work differently than Information Retrieval. • Lexical Relations do not have the meaning in query terms, but Query Semantics have. Thus, to augment and expand a query, query semantics are used first time. • It is one of the first designs that mention “semantic distance”, and “relationship strength” to create a semantic network of concepts. • It created the way to “Index Tiering”. Typically, search engines match the search terms to the documents as a whole. If the user is interested in specific information, for example, “sharks”, but a particular document about “beaches around the world”, for example, only has one sentence about sharks, it is unlikely that the search engine would return the document. Documents like the one described are likely to score very low under the query for “sharks”, if at all, because the document as a whole is not “about” sharks. Source: Methods and systems for detecting and extracting information
  • 9. Do You Remember Google Merge? • Similarity (“gluttonous” is similar to “greedy”) – Near Synonyms • Membership (“commissioner” is a member of “commission”) • Metonymy (whole/part relations) (“motor vehicle” has part “clutch pedal”) • Substance (e.g. “lumber” has substance “wood”) • Product (e.g. “Microsoft Corporation” produces “Microsoft Access”) • Attribute (“past”, “preceding” are attributes of “timing”) • Causation (e.g. travel causes displacement/motion) • Entailment (e.g. buying entails paying) • Lateral bonds (concepts closely related to one another, but not in one of the other relationships, e.g. “dog” and “dog collar”) • Capitonyms (Polish (Nation), polish (shining). • Troponym (Walking -> Hustle, Trot, Crawl) • Eponym (Tommy John Surgery, Biswanath Panda -> Panda Update) • Demonym (New Yorkers -> Population of New York City, not State) • Acronyms (NASA, North American Saxophone Alliance, National Auto Sport Association, National Association of Students of Architecture) •Source: Bill Slawski
  • 10. Formal Semantics • Formal Semantics involves philosophy of language and linguistics together. • Denotations of natural language expressions are used to understand the compositionality of words, and their references. • Nature of meaning is the philosophical part of formal semantics. • Nature of meaning involves the meanings that come from our nature (Constructivist, Coherence, Correspondence, Consensus, Pragmatic Theories). • Formal Semantics have two approaches. • Truth Conditions • Compositionality • Formal Semantics is related to Lexical Semantics, because based on lexical relations, the compositionality, and truth conditions change.
  • 11. Formal Semantics and Inquisitive Semantics • Inquisitive Semantics involve raising new but related issues to a truth value. • For example: “Aspirin is used against headache. Does it work against toothache?” • The “toothache” and “headache” here have lexical relations to each other as “meronyms”. • The Formal Semantics here helps to understand the truth value of “Aspirin” and its functions. • The Formal Semantics and Truth Conditions have two approaches. • Dynamic Semantics: The raised issues have to change the context, and the first premise has to be correct. • Static Semantics: The raised issue doesn’t have to be relevant, and premise doesn’t have to be true. • For example: “John gives SEO Suggestions as a Googler. Does John gives useful SEO Suggestions as a Googler?” • Technically, John’s occupation is not connected to the suggestions’ usefulness. • The Dynamic Semantics change the context of the previous sentence based on interpreter and receiver. • Multi-stage or chained reasoning is highly relevant to the Dynamic Semantics for “context direction”. Source: Multi-level Recommendation Reasoning over Knowledge Graphs with Reinforcement Learning
  • 12. Formal Semantics and Compositionality • Compsoitionality is to understand lexical relations between the subjects and objects. • The easiest way to have a formal semantics understanding for compositionality is removing all the meaningful lexical units from the sentence. • For the sentence “Contadu is the best technology for creating a semantical understanding to optimize content”. • “C is t-t for s-u to o-c”. • The structure here gives the composition of words, and how lexical relations are constructed with constituent rules. Source: Compositionality by Henk J. Verkuyl, Utrecht University
  • 13. Formal Semantics and Scope • Scope determines the validity of the specific declaration’s range. • Formal semantics helps machines to process the human language for understanding the specific scope. • For example: • “Every student has a favourite teacher”. -> It is not clear whether every student has the same teacher as their favourite or, all of them have different teachers as their favourite, or some of them have same teacher, and some of them have different teachers as their favourite. • “When three more votes are taken from the court, the decision will be as we want.” -> The not clear part here is that, why 3, and which 3. Does the court have different layers of officials with different vote values, or especially “X, Y, Z” officials needed to vote, and which other decision-givers are against the decision that the person wants.  This is the example of Inquisitive Semantics, use it for question generation. • There are other types of scopes, such as “scope islands”, “exceptional scopes”. Source: Context-Sensitivity and Individual Differences in the Derivation of Scalar Implicature
  • 14. Formal Semantics and Scope • Island Scopes are called Island because they can’t be taken out of that scope (island). • For example: “If every elephant in the sanctuary gains 5 pounds every next 6 months, I will get a promotion”.  The person doesn’t get another promotion whenever an elephant gains 5 pounds for every 6 months. It happens once. • Exceptional Scope reverses the scope islands with “a” indefinite. • For example, “If an elephant gains 5 pounds, I will take a promotion”  The disambiguous, and repetitiveness occur together. • Scope is important for Compositionality, and Compositionality is important for Lexical Semantics. Source: Creation of inferred queries for use as query suggestions
  • 15. Formal Semantics and Modality • Modaliy is part of Formal Semantics with propositional content, and philosophical logic. There are different modalities: • Permissible: Express the acts that are allowed. • Possible: Express the acts that are possible. • Quintessential: Express the acts’ features. • Evidential: Express the facts with factual source. • Habitual: Express the habits. • Iterative: Express the repeated acts. • Frequentative: Express the permanent facts. Source: Semantic frame identification with distributed word representations
  • 16. Formal Semantics and Binding • Binding is creating a bound between the predicate and the subject. The anaphors are used to express the connections between bound predicates and subjects. • Modality express the lexical relations’ features while binding is for lexical relations’ direction. • The sentence of “Nancy Pelosi must be next presidential candidate for her career”, the “must be” involves “strong possibility” while “career” is bound to “Nancy Pelosi”. • The set theory works here to create “People who must be next candidates for presential election” set, and “being a presidential candidate” as a possible “political career improvement” act, and “presidential candidate” becomes a topic that involves connections to other types of “candidacies”, while “political career steps”, and “political discussions” are connected to it. • The binding and modality works to create an Information Graph, together. • If the sentence changes as “Nancy Pelosi is the best possible candidate for every democrat in the US.”, the sentence has a possibility from a different “modality”, and concept of “scope” works here again. • Declaration tells that “Nancy Pelosi is a candidate” for “every Democrat in the US”. This explains the “scope” and “compositionality”. • Compositionality here is “N is a c for e d in the U.S” • The main issue here is that the scope doesn’t make sense. If a Democrat goes outside of the US, does it mean that “Nancy Pelosi is suddenly not the best candidate” anymore? Or, is he best candidate for every democrat literally? • Thus, the scope here affects the “modality” further, and makes the “possibility” “opinioated” rather than a “factual possibility”. • The Formal Semantics Components affect each other. • The output of the Formal Semantics affect the Lexical Semantics. • Lexical Semantics affect the Lexical Relations. • Lexical Relations affect the Information Graph, and Extraction. • Information Extraction determines the Knowledge Base (Raw Knowledge Graph). Source: Providing result-based query suggestions
  • 17. Formal Semantics and T-A-M (Tense-Aspect- Mood) • Tense-aspect-mood has different combinations to extract information, and relate lexicosemantics to each other within a data graph. • Tense involves the position of the action inside the timeline. • Past, Present, Future • Aspect involves extension of the state of action in timeline. • Unitary – Happened once and suddenly. • Continuous – Happens during the time. • Repeated – Happened repeatedly, will happen again. • Continuous • Mood (modality) involves the actuality of action. • Possibly: Might happen. • Necessity: Should happen. Source: Extracting Semantic Classes from Text
  • 18. Transition from Lexical Semantics to Query Semantics • Query Semantics and Lexical Semantics are different from each other but highly similar. • Lexically synonym words might appear irrelevant to each other, while in Query Semantics they are relevant. • For example, “Buy” and “Sell” are opposites, or antonyms for each other. • In Query Semantics, “Buy” and “Sell” are synonyms, in other words, they mean the same thing. • The “Soft Drinks” is different concept than “Coca Cola”. The “Soft Drinks” is a hypernym for Coca Cola in Lexical Semantics, but in Query Semantics, they might be synonyms.
  • 19. Transition from Lexical Semantics to Query Semantics • Query Semantics is used for “Query Inference”, and “Query Phrasification”. • The Query “Best temperature for Soft Drink” is a query for a hypernym in Lexical Semantics. • Query Semantics is used to generate the same search query for other members of the same set, because at the same time, they are synonyms in query semantics. • “Soft drinks such as Coca Cola” and “Coca Cola (Soft Drink)” doesn’t represent the same thing in Query Semantics. • Second phrase is more relevant to “Coca Cola”, while the first one is more relevant to entire “class of things”.
  • 20. Transition from Lexical Semantics to Query Semantics • “Best temperature for pepsi” query requires further query processing with lexicosemantics and query semantics. • “Best temperature for pepsi” has missing part. • For drinking • For serving • For producing • For storing • For Mixing • All the possible “verbs” come form “lexical semantics” and how they are used in “query search” language.
  • 21. Formal Semantics and T-A-M (Tense-Aspect- Mood) • Formal Semantics and T-A-M affect lexical semantics. • The “tense”, “aspect” and “mood” combinations create different lexical relations with contexts.
  • 22. Transition from Lexical Semantics to Query Semantics • The smallest query and word differences can create ranking changes, • even if search intent is same, • or they mean same thing. Compositionality by Henk J. Verkuyl, Utrecht University what should happen to someone who has hemophilia what can happen to someone who has hemophilia what happens to someone who has hemophilia
  • 23. Formal Semantics and T-A-M (Tense-Aspect- Mood) • The modality “should” represent a responsibility, and solution for a problem. • Thus, result focuses on “treatment” or “precaution”, even if rest of the sentence is same. what should not happen to someone who has hemophilia what will not happen to someone who has hemophilia what happened to someone who has hemophilia
  • 24. Formal Semantics and T-A-M (Tense-Aspect- Mood) • The lemmatization such as “effected”, and “effective” bring answers closers. • The predicate “show” is closer to “demonstrate”, and “metrics”, or “tests”. • The predicates, and possible compositionalities have different types of themes. what shows happen to someone who has hemophilia what effected to someone who has hemophilia
  • 33. Query Semantics • We also see that, “Cat” and “Dog” can be synonyms. • Part-time and Full-time can be synonyms. • But, sometimes they are also not synonyms. • For the query “find job”, they might be synonym. • For the query “buy pet”, they might be synonynm. • But for the “dog food”, it is not synonym. • “Sign in” and “Sign on” might be or might not be synonym. • “Address” might be contact, or just the address as well.
  • 34. Query Semantics • New York is not York. • York Hotels doesn’t mean New York Hotels. • But, Vegas is always Las Vegas. • If you search from Latin America, York is New York. • If you search from Africa, still, York is New York. • If you search from France, it is 50/50. • If you search from UK, it is not New York, again.
  • 35. Query Semantics • “New” appears alone a lot. • “York” appears without “New” sometimes. • The combination of phrases from the Documents help search engines to relate these things to each other, or differentiate them. • How documents use the query phrases determine how people search. • How people search affect how people use query phrases.
  • 36. Query Semantics • Bonus: Does it worth to index? • Even if 1,000,000 searches happen everyday? • What are the synonyms of facial expressions?
  • 37. Query Semantics • “Prove the cost is worth it”. • Do you worth for that cost if you do not use lexicosemantics?
  • 38. Let’s talk about “porn”. • This is Matt Cutts. • His first big task on Google was “finding spammy” but sometimes not spammy, but highly “sexual queries”. • Why? • S A F E S E A R C H.
  • 39. Let’s talk about “porn”. • And, how to find all these porns? • How do people search porn? • Matt Cutts was an expert on Web Spam, because adult websites use spam a lot. • “Tink two times, if your manager asks you that what do you think about porn.” • -Matt Cutts
  • 40. Let’s talk about “porn”. • Matt Cutts used 69 languages, and synonyms to find good phrases that can relate porns. • “I didn’t think about this before. People search porn with lots of different weird words.” • Matt Cutts tried to convince Google Employees to search porn with weird ways. • He distributed “cookies”, this is how “Google Cookie Porn” events happened. • Lexicosemantics and Query Semantics are tested first time with entire Google.