SlideShare ist ein Scribd-Unternehmen logo
1 von 10
Downloaden Sie, um offline zu lesen
Developments in Open-Tamil Library
T. Arulalan, T. Shrinivasan+
, and A. Muthiah*
INFIIT 2016, Dindugul, Tamilnadu
+tshrinivasan@gmail.com
*ezhillang@gmail.com
Introduction
● Help create high-level
applications in Tamil
● Fully open-source
● Published and
maintained from 2014
● Available via Python
Package index - pip
● Developed via Git
● Many contributions
from 10 developers
● Library for Tamil text
processing
● Word games
● Encoding conversions
Spell checker based on OpenTamil
Integration of solthiruthi spell-checker from open-tamil with TinyMCE web editor
Work in progress
Word Search Games
Game generated using Open-Tamil library
Find the world leaders
during the 1950s
Document Statistics
$ python solpattiyal.py document.txt
Outputs:
1. Number of unique words in doc
2. Word frequency
3. Words in Tamil dictionary sorted order
Open-tamil ச|வா|வால|வாசல|சவால
● Generate Anagrams
● Generate Combinations of words
● Partial words
● Check if word is a palindrome
import tamil
from solthiruthi.dictionary import *
TVU_dict = DictionaryBuilder.create(TamilVU)
word = u’சவால’
q=list(tamil.wordutils.combinagrams(word,TVU_dict))
print(u'|'.join(q))
which gives you the output, ச|வா|வால|வாசல|சவால
Ref: See ezhillang.wordpress.com blog here
Word Play - Tamil Anagrams
We can compute anagrams in Tamil
e.g. using TVU word list → at ezhillang blog
https://ezhillang.wordpress.com/2015/07/27/open-tamil-anagrams-in-tamil-vu-word-list/
Open-Tamil – Java library
● Available for use in Java
● Build Tamil apps easily
Quality
● Open-Tamil project is developed on www.github.com
● Approximately 16k LOC - (13,579 LOC), solthiruthi (1,594 LOC),
and ngram (187 LOC), in the latest development repository.
● Over 208 unittests (2,705 LOC) that test our source code
modules tamil
● All source code checkin on github trigger the continuous
integration tests via Travis-CI
● Supported Python flavors ( v2.6, v2.7, v3.3, v3.5, and PyPy)
● Manual testing of Java and Ruby tests.
● Github workflow
Conclusions
● Further contributions required to document the
library and write tutorials for us
● Improve quality, test and report bugs
● More students and developers may avail this
library and build high level applications
● Project support and sponsors are sought out

Weitere ähnliche Inhalte

Andere mochten auch

A quick quiz 2011 qs
A quick quiz 2011 qsA quick quiz 2011 qs
A quick quiz 2011 qsNeill Ford
 
Medical quiz in images
Medical quiz in imagesMedical quiz in images
Medical quiz in imageskiranvs123
 
1500 computer awareness_bits_for_ibps
1500 computer awareness_bits_for_ibps1500 computer awareness_bits_for_ibps
1500 computer awareness_bits_for_ibpsRicky Dky
 
Computer fundamentals mcq quiz - Practice and prepare with mcqSets.com
Computer fundamentals mcq quiz - Practice and prepare with mcqSets.comComputer fundamentals mcq quiz - Practice and prepare with mcqSets.com
Computer fundamentals mcq quiz - Practice and prepare with mcqSets.comSuresh Khanal
 
SCIENCE COMPUTER MATH Quiz Bee
SCIENCE COMPUTER MATH Quiz BeeSCIENCE COMPUTER MATH Quiz Bee
SCIENCE COMPUTER MATH Quiz BeeAlvin Ayo-ong
 
Microsoft Excel MCQ Bank from mcqSets.com (Multiple Choice Questions from Excel)
Microsoft Excel MCQ Bank from mcqSets.com (Multiple Choice Questions from Excel)Microsoft Excel MCQ Bank from mcqSets.com (Multiple Choice Questions from Excel)
Microsoft Excel MCQ Bank from mcqSets.com (Multiple Choice Questions from Excel)Suresh Khanal
 
Computer Quiz (August 2013)
Computer Quiz (August 2013)Computer Quiz (August 2013)
Computer Quiz (August 2013)Soham Banerjee
 
Computer quiz for primary classes
Computer quiz for primary classesComputer quiz for primary classes
Computer quiz for primary classesRajashekar_rs
 

Andere mochten auch (9)

A quick quiz 2011 qs
A quick quiz 2011 qsA quick quiz 2011 qs
A quick quiz 2011 qs
 
Medical quiz in images
Medical quiz in imagesMedical quiz in images
Medical quiz in images
 
1500 computer awareness_bits_for_ibps
1500 computer awareness_bits_for_ibps1500 computer awareness_bits_for_ibps
1500 computer awareness_bits_for_ibps
 
Computer fundamentals mcq quiz - Practice and prepare with mcqSets.com
Computer fundamentals mcq quiz - Practice and prepare with mcqSets.comComputer fundamentals mcq quiz - Practice and prepare with mcqSets.com
Computer fundamentals mcq quiz - Practice and prepare with mcqSets.com
 
SCIENCE COMPUTER MATH Quiz Bee
SCIENCE COMPUTER MATH Quiz BeeSCIENCE COMPUTER MATH Quiz Bee
SCIENCE COMPUTER MATH Quiz Bee
 
Microsoft Excel MCQ Bank from mcqSets.com (Multiple Choice Questions from Excel)
Microsoft Excel MCQ Bank from mcqSets.com (Multiple Choice Questions from Excel)Microsoft Excel MCQ Bank from mcqSets.com (Multiple Choice Questions from Excel)
Microsoft Excel MCQ Bank from mcqSets.com (Multiple Choice Questions from Excel)
 
Computer Quiz (August 2013)
Computer Quiz (August 2013)Computer Quiz (August 2013)
Computer Quiz (August 2013)
 
Computer quiz for primary classes
Computer quiz for primary classesComputer quiz for primary classes
Computer quiz for primary classes
 
IT Quiz
IT QuizIT Quiz
IT Quiz
 

Ähnlich wie Developments in Open-Tamil Library for Text Processing and Word Games

Open-Tamil text processing library
Open-Tamil text processing libraryOpen-Tamil text processing library
Open-Tamil text processing libraryezhillang
 
Tamilinayavaani - integrating tva open-source spellchecker with python
Tamilinayavaani -  integrating tva open-source spellchecker with pythonTamilinayavaani -  integrating tva open-source spellchecker with python
Tamilinayavaani - integrating tva open-source spellchecker with pythonShrinivasan T
 
Growth and evolution of Open-Tamil
Growth and evolution of Open-TamilGrowth and evolution of Open-Tamil
Growth and evolution of Open-Tamilezhillang
 
Python Programming Introduction For Students
Python Programming Introduction For StudentsPython Programming Introduction For Students
Python Programming Introduction For StudentsShaunakBale1
 
Python Programming Awarenes for Kids and Non Programmers
Python Programming Awarenes for Kids and Non ProgrammersPython Programming Awarenes for Kids and Non Programmers
Python Programming Awarenes for Kids and Non ProgrammersElaiya Iswera Lallan
 
introduction to Python (for beginners)
introduction to Python (for beginners)introduction to Python (for beginners)
introduction to Python (for beginners)guobichrng
 

Ähnlich wie Developments in Open-Tamil Library for Text Processing and Word Games (9)

Open-Tamil text processing library
Open-Tamil text processing libraryOpen-Tamil text processing library
Open-Tamil text processing library
 
Tamilinayavaani - integrating tva open-source spellchecker with python
Tamilinayavaani -  integrating tva open-source spellchecker with pythonTamilinayavaani -  integrating tva open-source spellchecker with python
Tamilinayavaani - integrating tva open-source spellchecker with python
 
Growth and evolution of Open-Tamil
Growth and evolution of Open-TamilGrowth and evolution of Open-Tamil
Growth and evolution of Open-Tamil
 
Python Programming Introduction For Students
Python Programming Introduction For StudentsPython Programming Introduction For Students
Python Programming Introduction For Students
 
Python Programming Awarenes for Kids and Non Programmers
Python Programming Awarenes for Kids and Non ProgrammersPython Programming Awarenes for Kids and Non Programmers
Python Programming Awarenes for Kids and Non Programmers
 
Python Training in Bangalore
Python Training in BangalorePython Training in Bangalore
Python Training in Bangalore
 
introduction to Python (for beginners)
introduction to Python (for beginners)introduction to Python (for beginners)
introduction to Python (for beginners)
 
Introduction to python
Introduction to pythonIntroduction to python
Introduction to python
 
summer t.pdf
summer t.pdfsummer t.pdf
summer t.pdf
 

Kürzlich hochgeladen

Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823
 
Clustering techniques data mining book ....
Clustering techniques data mining book ....Clustering techniques data mining book ....
Clustering techniques data mining book ....ShaimaaMohamedGalal
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendArshad QA
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 

Kürzlich hochgeladen (20)

Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
 
Clustering techniques data mining book ....
Clustering techniques data mining book ....Clustering techniques data mining book ....
Clustering techniques data mining book ....
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and Backend
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Exploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the ProcessExploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the Process
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 

Developments in Open-Tamil Library for Text Processing and Word Games

  • 1. Developments in Open-Tamil Library T. Arulalan, T. Shrinivasan+ , and A. Muthiah* INFIIT 2016, Dindugul, Tamilnadu +tshrinivasan@gmail.com *ezhillang@gmail.com
  • 2. Introduction ● Help create high-level applications in Tamil ● Fully open-source ● Published and maintained from 2014 ● Available via Python Package index - pip ● Developed via Git ● Many contributions from 10 developers ● Library for Tamil text processing ● Word games ● Encoding conversions
  • 3. Spell checker based on OpenTamil Integration of solthiruthi spell-checker from open-tamil with TinyMCE web editor Work in progress
  • 4. Word Search Games Game generated using Open-Tamil library Find the world leaders during the 1950s
  • 5. Document Statistics $ python solpattiyal.py document.txt Outputs: 1. Number of unique words in doc 2. Word frequency 3. Words in Tamil dictionary sorted order
  • 6. Open-tamil ச|வா|வால|வாசல|சவால ● Generate Anagrams ● Generate Combinations of words ● Partial words ● Check if word is a palindrome import tamil from solthiruthi.dictionary import * TVU_dict = DictionaryBuilder.create(TamilVU) word = u’சவால’ q=list(tamil.wordutils.combinagrams(word,TVU_dict)) print(u'|'.join(q)) which gives you the output, ச|வா|வால|வாசல|சவால Ref: See ezhillang.wordpress.com blog here
  • 7. Word Play - Tamil Anagrams We can compute anagrams in Tamil e.g. using TVU word list → at ezhillang blog https://ezhillang.wordpress.com/2015/07/27/open-tamil-anagrams-in-tamil-vu-word-list/
  • 8. Open-Tamil – Java library ● Available for use in Java ● Build Tamil apps easily
  • 9. Quality ● Open-Tamil project is developed on www.github.com ● Approximately 16k LOC - (13,579 LOC), solthiruthi (1,594 LOC), and ngram (187 LOC), in the latest development repository. ● Over 208 unittests (2,705 LOC) that test our source code modules tamil ● All source code checkin on github trigger the continuous integration tests via Travis-CI ● Supported Python flavors ( v2.6, v2.7, v3.3, v3.5, and PyPy) ● Manual testing of Java and Ruby tests. ● Github workflow
  • 10. Conclusions ● Further contributions required to document the library and write tutorials for us ● Improve quality, test and report bugs ● More students and developers may avail this library and build high level applications ● Project support and sponsors are sought out