27
December

Amazing developments in AI related to linguistic theory

Now this is fascinating – Dr Daoud Clarke claims, in brand new research paper, to have devised a mathematically-based model that can determine the meaning of words and phrases based on the context in which they are found. His theory allows vectors or numbers to represent words and phrases. The implications of this for companies such as Google who are always looking to increase the relevance and quality of the search results they display to internet users is staggering and will only add to the power they have to determine which sites, articles and words are thematically related.

 

Dr Clarke says:

“There are existing techniques which can build these vectors for words by looking at the contexts they occur in, for example on the web. This works well for words or short phrases, but if you want to extend this to long  or whole sentences, you quickly run out of data, even on the web. Our theory tells you what the vector for a phrase should look like in terms of the vectors for the individual words that make up the phrase.

“For example, at the moment we may have vectors for ‘big’ and ‘cat’, but we don’t know the best way to combine them to get a vector for ‘big cat’,” explained Dr. Clarke. “There are lots of possibilities: for example you could add the two vectors together, but then ‘big cat’ would have to mean the same as ‘cat big’, which doesn’t make sense.

The value of the theory is in identifying which methods of combining vectors do make sense. Our theory will tell you if your method of combining vectors is consistent with the idea that meaning is determined by context.”

According to Dr. Clarke, most existing theories in this field are based around the idea that the meaning of sentences can be represented in terms of logic, but these cannot capture the subtleties of language, such as the relationship between the words “like” and “love”. Representing meanings of words using vectors allows fuzzy relationships between words to be expressed as the distance or angle between the vectors.

Dr. Clarke believes his theory will have many applications in , in particular helping web search engines understand the meaning of your query. “Google works by looking for the words you type in the documents it knows about,” said Dr Clarke. “If you type in a long phrase or sentence, it just tries to match as many words as possible. Imagine how powerful it could be if it understood the meaning of your query, and tried to match it to the closest in all the documents it knows about.”

 

Read more at http://www.physorg.com/news/2011-09-mathematical-enable-web.html

 

No comments yet

23
December

What is linguistics?

Every language is governed by a set of rules that make up the structure, format and composition of the language. It includes how words are formed, how sentences and segments of speech are formed and the system of the audible sounds that make up that language.

This study of all of this is what’s known as “linguistics”

As an academic discipline linguistics encompasses many other areas of speech and language. Some researchers specialise in studying the history, origins and continued evolution of sounds, words and languages. Others concentrate on regional differences (dialects), accents and slang. Many slang words evolve and become an accepted part of that language’s lexicon – eg. recently “texting” “googling” “lol” “wtf” and thousands of other examples.

Another important aspect of the study of linguistics is understanding how meaning is communicated through the use of particular words, intonations, and larger concepts.

The variety of languages around the world is stunning. Differences in grammar, sentence structure, and sounds used to communicate in that language can be bewildering. It is exactly this that inspires linguists to understand more about how we communicate. Why is it that we all communicate so differently and how did that method of communication come to be? Many languages, especially English, have descended from a variety of older languages over thousands of years which is why our language contains so many words from languages such as French, Italian, German, Greek, Latin and more. The study of how particular, individual words were formed is known as “etymology”.

Read more about linguistics at these sites:

http://www.sil.org/linguistics/GlossaryOfLinguisticTerms/index.htm

http://languagelog.ldc.upenn.edu/nll/

No comments yet

     

 
WordPress SEO fine-tune by Meta SEO Pack from Poradnik Webmastera