SlideShare ist ein Scribd-Unternehmen logo
1 von 76
Downloaden Sie, um offline zu lesen
Ve
@KIMTARO
JISHO.ORG
LINGUISTIC
FRAMEWORK
  FOR YOU
“
HOLY SHIT
WHY AREN’T WE USING
MORE LINGUISTICS
? ! ? ! ? ! =! ? ? ! =! ? ! 111o ne
                                  ”
USES?
SEARCH
!ies
!ies
 !y
SENTENCES
The U.S.A.
Ambassador
to S. Korea.
The U.S.A.
Ambassador
to S. Korea.
WORDS
TRANSLITERATIONS
TAGGING
“I like beer„
“I like beer„
   beer
PROBLEMS
:(
STEMMING
!ies
!ies

!i
!ies

!i
LANGAUGE
HAS
EDGE
CASES
IR
REGU
 LAR
   I
 TIES
ENGLISH
 ONLY
PARSERS
ALL. THE.
THINGS.
FUCK YEAH
CRASH
BUGGY
MEMORY
INPUT
OUTPUT
CHARACTER
ENCODINGS
DESTRUCTIVE
CASING
 WHITESPACE
PUNCTUATION
LINGUISTS
  ONLY
DIFFERENT
  PART OF
SPEECH SETS
BROWN CORPUS: 230
IPADIC: 120
飲ん�����動詞
だ ����助動詞
MANAGES
PARSERS
ONE API
 TO RULE
THEM ALL
MULTIPLE
LANGUAGES
UNIFIED
 PART OF
SPEECH SET
18
LEMMA
!ies

!y
YOUR DATA
LOW
 LEVEL
ACCESS
FREELING

Lluís Padró
MECAB
VE CLIENTS         RUBY gem                   JS lib

VE SERVER
                                         SERVER

 INTERFACES   RUBY local        RUBY remote            JS

 MANAGERS       EN             JA          SW          CH

 PROVIDERS


  PARSERS     EN     EN   EN        JA   JA   SW       CH
gem "ve", '0.0.2', :git =>
'git://github.com/Kimtaro/ve.git'
words = Ve.in(:ja).words(input)
words.each do |word|
  word.word
  word.part_of_speech.name
  word.extra[:reading].katakana_to_hiragana
end
#<Ve::Word:0x8edfe24
@word="melons",
@lemma="melon",
@part_of_speech=Ve::PartOfSpeech::Noun,
@tokens=[{
  :raw=>"melons melon NNS 1",
  :type=>:parsed,
  :literal=>"melons",
  :lemma=>"melon",
  :pos=>"NNS",
  :accuracy=>"1",
  :characters=>7..12}],
@extra={:grammar=>:plural},
@info={}>
#<Ve::Word:0x8edfe24
@word="melons",
@lemma="melon",
@part_of_speech=Ve::PartOfSpeech::Noun,
@tokens=[{
  :raw=>"melons melon NNS 1",
  :type=>:parsed,
  :literal=>"melons",
  :lemma=>"melon",
  :pos=>"NNS",
  :accuracy=>"1",
  :characters=>7..12}],
@extra={:grammar=>:plural},
@info={}>
#<Ve::Word:0x8edfe24
@word="melons",
@lemma="melon",
@part_of_speech=Ve::PartOfSpeech::Noun,
@tokens=[{
  :raw=>"melons melon NNS 1",
  :type=>:parsed,
  :literal=>"melons",
  :lemma=>"melon",
  :pos=>"NNS",
  :accuracy=>"1",
  :characters=>7..12}],
@extra={:grammar=>:plural},
@info={}>
#<Ve::Word:0x8edfe24
@word="melons",
@lemma="melon",
@part_of_speech=Ve::PartOfSpeech::Noun,
@tokens=[{
  :raw=>"melons melon NNS 1",
  :type=>:parsed,
  :literal=>"melons",
  :lemma=>"melon",
  :pos=>"NNS",
  :accuracy=>"1",
  :characters=>7..12}],
@extra={:grammar=>:plural},
@info={}>
Ve.in(:ja).sentences(input)
Ve.in(:ja).
transliterate_from_hira_to_latn(input)
<script src="ve.js"></script>
<script>
  Ve('en').words('I like beer',
    function(words) {
      $.each(words, function(i, w) {
           w.lemma;
           w.part_of_speech;
      });
    });
</script>
github.com/Kimtaro/ve

Weitere ähnliche Inhalte

Was ist angesagt?

Tvl simple value winner
Tvl simple value winnerTvl simple value winner
Tvl simple value winner
karenpasacreta
 
Provincia Germán Busch
Provincia Germán BuschProvincia Germán Busch
Provincia Germán Busch
luismarcelo07
 
Factors i conseqüencies de l'abús de l'alcohol
Factors  i conseqüencies de l'abús de l'alcoholFactors  i conseqüencies de l'abús de l'alcohol
Factors i conseqüencies de l'abús de l'alcohol
agalinanes
 
Social Privacy for HTTP over Webfinger
Social Privacy for HTTP over WebfingerSocial Privacy for HTTP over Webfinger
Social Privacy for HTTP over Webfinger
Blaine
 
Betagroup 12 - Board of Innovation
Betagroup 12 - Board of InnovationBetagroup 12 - Board of Innovation
Betagroup 12 - Board of Innovation
Board of Innovation
 

Was ist angesagt? (16)

Marketing 1 182798-1 (1)
Marketing 1 182798-1 (1)Marketing 1 182798-1 (1)
Marketing 1 182798-1 (1)
 
Self Review and Personal Growth
Self Review and Personal GrowthSelf Review and Personal Growth
Self Review and Personal Growth
 
OIF040 HR & Social Media
OIF040 HR & Social MediaOIF040 HR & Social Media
OIF040 HR & Social Media
 
Selfmanagement & Getting Things Done (GTD)
Selfmanagement & Getting Things Done (GTD)Selfmanagement & Getting Things Done (GTD)
Selfmanagement & Getting Things Done (GTD)
 
Tvl simple value winner
Tvl simple value winnerTvl simple value winner
Tvl simple value winner
 
Provincia Germán Busch
Provincia Germán BuschProvincia Germán Busch
Provincia Germán Busch
 
Factors i conseqüencies de l'abús de l'alcohol
Factors  i conseqüencies de l'abús de l'alcoholFactors  i conseqüencies de l'abús de l'alcohol
Factors i conseqüencies de l'abús de l'alcohol
 
Social Privacy for HTTP over Webfinger
Social Privacy for HTTP over WebfingerSocial Privacy for HTTP over Webfinger
Social Privacy for HTTP over Webfinger
 
Intro a Ember.js
Intro a Ember.jsIntro a Ember.js
Intro a Ember.js
 
SK컴즈 행복한 일터를 만드는 사람들
SK컴즈 행복한 일터를 만드는 사람들SK컴즈 행복한 일터를 만드는 사람들
SK컴즈 행복한 일터를 만드는 사람들
 
Cepae 2013
Cepae 2013Cepae 2013
Cepae 2013
 
Derecho colectivo del_trabajo_
Derecho colectivo del_trabajo_Derecho colectivo del_trabajo_
Derecho colectivo del_trabajo_
 
Comunidad educativa pdf
Comunidad educativa pdfComunidad educativa pdf
Comunidad educativa pdf
 
Cancino
CancinoCancino
Cancino
 
Betagroup 12 - Board of Innovation
Betagroup 12 - Board of InnovationBetagroup 12 - Board of Innovation
Betagroup 12 - Board of Innovation
 
Acrostics 2n
Acrostics 2nAcrostics 2n
Acrostics 2n
 

Ve, a linguistic framework.