LACS building

De Assothink Wiki
Aller à la navigation Aller à la recherche

Context

This page is related to the construction of the passive jelly. It has nothing to do with the active jelly.

This paage describes a theoretical approach, not the parcatical software process used to build the Assothink universe.

Concepts and percepts

The passive jelly includes concepts, percepts and variants. Variants will not be discussed here.

As explained elsewhere, concepts

  • are intrinsically unnamed
  • are universal (most of them)
  • are primary object (in the Assothink model)
  • exist mainly thru the links they have with other concepts

On the other side, percepts

  • are language words
  • are used to represent concepts
  • are not universal

A human being uses his brain to

  • manipulate concepts
  • communicate through concepts

Concept categories

Conceptsare organized in categories.

They are many ways to structure concepts into categories.

The Assothink model handles 8 main categories

But in this page, we will consider the 4 most common categories:

  • nouns (N) (things)
  • verbs (V) (actions)
  • adjectives (P) (qualifiers for things)
  • adverbs (D) (qualifiers for actions)

LACS

A LACS is a Language-Anchored Concept Space.

Is it a set of connected words and concepts.

Example of LACS (reviewed with more details below) include:

  • wordnet
  • wikipedia
  • wiktionary
  • freebase

And of course Assothink uses its own LACS, the Assothink LACS.

MoLACS and MuLACS

A MoLACS is a Mono-Language-ACS.

A MuLACS is a Multi-Language-ACS.

LACS review

LACS may be described with various criteria

  • categories handled
  • MoLACS or MuLACS
  • concept-centric or percept-centric
  • size

The following table summarizes the properties of the most known LACS, compared to the Assothink LACS.


Categories Mu or Mo Size Centric Remarks
Wordnet NPVD MoLACS english average Concept-centric

The brilliant precursor

Concepts = synsets

Wikipedia N... MuLACS big Word-centric mostly

Brilliant, rich

Poorly organized

Wiktionary NPVD MuLACS big Word-centric

Briliant rich

Poorly organized

Freebase N(pvdxxx) MuLACS huge Hybrid

anarchic linking

remarakbly exhaustive

uncontrolled growth

Assothink NPVD MuLACS small Concept-centric

The best is coming

Only 10K... 30K concepts


The Assothink LACS is certainly not the biggest but it is the most demanding in terms on coherence and strength.

It would be the first concept-centric full-NPVD MuLACS, so it is a pioneer.

LACS Integration

The Assothin LACS is not built per se.

Other LACS (freebase typically) are growing by the process of swallowing ontologies and other LACS.

The Assothink LACS integrates selected parts of all LACS listed here.

The building process of the Assothink LACS involves two kind of tasks:

  • interface, analyse, decode, classify, select, filter data from other LACS. This process is heavy in terms of managed data (the full wikipedia dumps are huge, and freebase is even much bigger).
  • match concepts present in multiple LACS to create a small but homegenous set of concepts. The concepts selected are known and referenced in other LACS.

Practically, the integration process used to built the noun Assothink LACS uses mainly wordnet, wikipedia, freebase (and language thesauri).

But the integration process used to built the verb/adverb/adjective Assothink LACS uses mainly wordnet and wiktionary.