Sanskrit and Computational Linguistic




G Jaganadh
Linguist
C-DAC Thiruvananthapuram
Talk Overview
Introduction

Computational Linguistics

Sanskrit in Computational Linguistics

Computational Linguistics for Research in Sanskrit

Towards future
Introduction


Linguistics
   the study of human language, including subjects such as phonology,

 morphology, syntax, semantics, lexicon, sociolinguistics, and

 psycholinguistics
Computational Linguistics

  The interdisciplinary field which involves both linguistics and
computer science, and is concerned with automatising the analysis of
text and speech corpora, developing precise models of grammars
and lexica which can be processed automatically.
Corpus Linguistics

  Corpus linguistics is the study of language as expressed
in samples (corpora) or "real world" text. This method
represents a digestive approach to deriving a set of
abstract rules by which a natural language is governed or
else relates to another language ..
Corpus


  A collection of written and/or spoken language stored on
a computer and used for language research and writing
dictionaries
Sanskrit in Computational
Linguistics



Machine Aided Translation

Knowledge Representation
Machine Translation
The process of automated translation of Natural
Language Texts
Anusaarak
AnglaBharati
Shakti
Knowledge Representation



Rick Briggs Research Findings
Language Tools for Sanskrit

Morphological Analyser – Akshar Bharti, Amba P
Kulakarni and V .Sheeba.
Gerard Huet
Girish Nath Jaha
POS Tagger – Girish Nath Jaha et.all
Parser – Gerard Huet …..
Computational Linguistics for Research in
 Sanskrit


Applications of Computational Linguistics in Sanskrit
Corpus based studies
  Unigram
  Bigram
  Trigram
  Collocation
  Concordance Generation
…….

Phonetics
Ontology
WordNet
Computational Tools for Sanskrit Community




Spell checking system
Grammar Checking System
Vritta analyser
Interactive Sanskrit Learning Kit
Resources for Sanskrit Computational
    Linguistics


   Corpora text and Speech
   Parallel Corpora
Towards future

   Sanskrit Grammarians can      …………

    Inter Disciplinary Approach
   Technological Education for Sanskrit Scholars
   Resource and HR development in Sanskrit Computational
    Linguistics
   Localization in Sanskrit
Useful Links
   sanskrit.inria.fr/ZEN/
   sanskrit.inria.fr/Symposium/
   sanskrit.jnu.ac.in/
   www.experiencefestival.com/sanskrit_-
    _computational_linguistics/articleindex
   ltrc.iiit.net/~anusaaraka/
   sa.wikipedia.org
   http://subhashcdac.blogspot.com/2007/06/blo
    g-post_7632.html
   indology.info/
   nyaya.darsana.org
   We offers
   PGDLT – Post Graduate Diploma in
    Language Technology
   For Language students
   Extensive training in Computational
    Linguistics and Localization
Question ?
Thanks
Mail to me – jaganadhg@gmailc.om

Sanskrit and Computational Linguistic

  • 1.
    Sanskrit and ComputationalLinguistic G Jaganadh Linguist C-DAC Thiruvananthapuram
  • 2.
    Talk Overview Introduction Computational Linguistics Sanskritin Computational Linguistics Computational Linguistics for Research in Sanskrit Towards future
  • 3.
    Introduction Linguistics the study of human language, including subjects such as phonology, morphology, syntax, semantics, lexicon, sociolinguistics, and psycholinguistics
  • 4.
    Computational Linguistics The interdisciplinary field which involves both linguistics and computer science, and is concerned with automatising the analysis of text and speech corpora, developing precise models of grammars and lexica which can be processed automatically.
  • 5.
    Corpus Linguistics Corpus linguistics is the study of language as expressed in samples (corpora) or "real world" text. This method represents a digestive approach to deriving a set of abstract rules by which a natural language is governed or else relates to another language ..
  • 6.
    Corpus Acollection of written and/or spoken language stored on a computer and used for language research and writing dictionaries
  • 7.
    Sanskrit in Computational Linguistics MachineAided Translation Knowledge Representation
  • 8.
    Machine Translation The processof automated translation of Natural Language Texts Anusaarak AnglaBharati Shakti
  • 9.
  • 10.
    Language Tools forSanskrit Morphological Analyser – Akshar Bharti, Amba P Kulakarni and V .Sheeba. Gerard Huet Girish Nath Jaha POS Tagger – Girish Nath Jaha et.all Parser – Gerard Huet …..
  • 11.
    Computational Linguistics forResearch in Sanskrit Applications of Computational Linguistics in Sanskrit Corpus based studies Unigram Bigram Trigram Collocation Concordance Generation
  • 12.
  • 13.
    Computational Tools forSanskrit Community Spell checking system Grammar Checking System Vritta analyser Interactive Sanskrit Learning Kit
  • 14.
    Resources for SanskritComputational Linguistics  Corpora text and Speech  Parallel Corpora
  • 15.
    Towards future  Sanskrit Grammarians can …………  Inter Disciplinary Approach  Technological Education for Sanskrit Scholars  Resource and HR development in Sanskrit Computational Linguistics  Localization in Sanskrit
  • 16.
    Useful Links  sanskrit.inria.fr/ZEN/  sanskrit.inria.fr/Symposium/  sanskrit.jnu.ac.in/  www.experiencefestival.com/sanskrit_- _computational_linguistics/articleindex  ltrc.iiit.net/~anusaaraka/  sa.wikipedia.org  http://subhashcdac.blogspot.com/2007/06/blo g-post_7632.html
  • 17.
    indology.info/  nyaya.darsana.org
  • 18.
    We offers  PGDLT – Post Graduate Diploma in Language Technology  For Language students  Extensive training in Computational Linguistics and Localization
  • 19.
  • 20.
    Thanks Mail to me– jaganadhg@gmailc.om