Basic usage¶

Synset and Variants¶

To create a new (empty) synset:

>>> import eurown
>>> a = eurown.Synset()
>>> print a
<eurown.Synset object at 0x80ab10c>
>>> a.polarisText
u'0 WORD_MEANING'
>>> print a.polarisText
0 WORD_MEANING
>>>

Property polarisText returns (unicode) string of Synset in Polaris import-export format.

Synset has part of speech property, that can be one of ‘a’,’b’,’v’,’n’, or pre-defined as ‘pn’ if we have WORD_INSTANCE instead of WORD_MEANING:

>>> a.pos = 'n'
>>> print a.polarisText
0 WORD_MEANING
  1 PART_OF_SPEECH "n"
>>> b = eurown.WordInstance()
>>> print b.polarisText
0 WORD_INSTANCE
  1 PART_OF_SPEECH "pn"

To make some new variants (literal and sense number, gloss for var3 as well):

>>> var1 = eurown.Variant(literal='test',sense=1)
>>> var2 = eurown.Variant(literal='trial',sense=1)
>>> var3 = eurown.Variant(literal='test',sense=2)
>>> var3.gloss = u'This is test'
>>> var4 = eurown.Variant(literal='exam',sense=1)

Let’s assign variants var1 and var2 to synset a:

>>> a.variants = eurown.Variants([var, var2])
>>> print a.polarisText
WORD_MEANING
PART_OF_SPEECH "n"
VARIANTS
LITERAL "test"
SENSE 1
LITERAL "trial"
SENSE 1

and make a new synset and assign to it variants var3 and var4:

>>> snset2 = eurown.Synset(pos='n')
>>> snset2.variants = eurown.Variants([var3, var4])
>>> print var3.polarisText
2 LITERAL "test"
  3 SENSE 2
  3 DEFINITION "this is test"

pluss vaiant var5 to append directly to snset2.variants:

>>> snset2.variants.append(eurown.Variant(literal='examination',sense=1))

Now we should have a synset (snset2) with three variants:

>>> print snset2.polarisText
WORD_MEANING
PART_OF_SPEECH "n"
VARIANTS
LITERAL "test"
SENSE 2
DEFINITION "this is test"
LITERAL "exam"
SENSE 1
LITERAL "examination"
SENSE 1

Internal Relations¶

Relation consists of relation name and target concept. Let’s make a synset that would have a relation to our snset2:

>>> snset3 = eurown.Synset(pos='n')
>>> var6 = eurown.Variant(literal="communication",sense=1)
>>> var7 = eurown.Variant(literal="communicating",sense=1)
>>> snset3.variants = eurown.Variants([var6,var7])
>>> print snset3.polarisText
0 WORD_MEANING
  1 PART_OF_SPEECH "n"
    1 VARIANTS
      2 LITERAL "communication"
        3 SENSE 1
      2 LITERAL "communicating"
        3 SENSE 1

Now we can link it to our snset2 via “has_hyperonym” relation:

>>> rel = eurown.Relation(name='has_hyperonym',target_concept=snset3)
>>> snset2.addRelation(rel)
>>> print snset2.polarisText
0 WORD_MEANING
  1 PART_OF_SPEECH "n"
  1 VARIANTS
    2 LITERAL "test"
      3 SENSE 2
      3 DEFINITION "This is test"
    2 LITERAL "exam"
      3 SENSE 1
    2 LITERAL "examination"
      3 SENSE 1
  1 INTERNAL_LINKS
    2 RELATION "has_hyperonym"
      3 TARGET_CONCEPT
        4 PART_OF_SPEECH "n"
        4 LITERAL "communication"
          5 SENSE 1

The same result will give the addRelation() function.

Interlingual Equivalence Links¶

Interlingual Equivalence link may have

Parsing IO File¶

Parsing Polaris IO file is done by Parser. At first, we should create an instance of a parser:

>>> p = eurown.Parser()

Parser can get file name:

>>> p.fileName = 'kb59-utf_8.txt'

We can parse one line, one synset, or even one wordnet file at a time.

WordNet¶

The module can deal with more than one wordnet at a time. While instantiating a wordnet, we should give file name and make all necessary indexes. Making indexes may take time:

>>> wn = eurown.WordNet(name='et', ioFileName='kb59-utf_8.txt')
>>> wn.make_indexes()

Examples¶

A script that will ask user for a word to find and prints out some basic information (literal, sense, gloss and examples) for each synset:

import eurown

wn = eurown.WordNet(name='et',
                    ioFileName='kb59-utf_8.txt')

wn.make_indexes()

def test_by_literal(literal):
    if literal in wn.literalIndex:
        snset_offsets = wn.literalIndex[literal]
        for i in snset_offsets:
            print i
            p = eurown.Parser(fileName='kb59-utf_8.txt')
            synset = p.parse_synset(offset=i)
            print 5*'='
            for j in synset.variants:
                print '%s_%d' % (j.literal, j.sense)
                print j.gloss
                print j.examples

def show_synsets():
    while 1:
        a = raw_input('otsi: ')
        test_by_literal(a)

show_synsets()

Basic usage¶

Synset and Variants¶

Internal Relations¶

Interlingual Equivalence Links¶

Parsing IO File¶

WordNet¶

Examples¶

Table Of Contents

Previous topic

Next topic

This Page

Navigation

Basic usage¶

Synset and Variants¶

Internal Relations¶

Interlingual Equivalence Links¶

Parsing IO File¶

WordNet¶

Examples¶

Table Of Contents

Previous topic

Next topic

This Page

Quick search

Navigation