SlideShare ist ein Scribd-Unternehmen logo
1 von 21
Growth and Evolution of Open-
Tamil
INFITT Tamil Internet Conference,
Coimbatore, India - 2018
Syed Abuthahir M, T. Arulalan, Sathia Narayanan,
Surendhar Ravichandran,
A. Arunram, T. Shrinivasan and Muthiah Annamalai
developerabu@gmail.com
Process Tamil Text
Installing
•
Python Package - https://pypi.org/project/Open-Tamil/
•
Git-Hub Collaboration -
https://github.com/Ezhil-Language-Foundation/open-tamil
•
Install Using Command – sudo pip install open-tamil==0.7
Introduction
•
Help create high-level applications in tamil
•
Fully open-source
•
Many contributions from 12 developers
•
Library for Tamil text processing
•
Encoding Conversions
Spell Checker For Tamil
Anagram
What's New?
•
Our new contributors enabled Sandhi checker
•
And new Web interface based on Django for open-tamil hosted at
http://tamilpesu.us
A View of Tamilpesu
Tamilpesu.us
•
Published some of important functions within open-tamil on web-
interface views
•
The web-interface also allow a simple json API as well for the
following actions
1. Word search application -
2. Tamil numeral generator -
3. Tamil sandhi checker -
4. Transliterator for Tamil -
5. Tamil word spelling checker -
6. Unicode encoding converter -)
7. Tamil anagram checker -. etc..
Web interface For tamil numeral
generator
Multiplication Table Generator
Sandhi Checker Web Interface
Word Search
Online Encoding convertor
Json Api service
Commad Line Utilities
•
Few user commands are provided in latest release of Open-Tamil
project for using library functions as command line tools
   1. tamilphonetic 
   2. tamilwordfilter   
   3. tamilurlfilter  
   4. tamiltscii2utf8
  5. tamilwordgrid
  6. tamilwordcount
Quality
•
Open-Tamil project is developed on Github
•
Approximately 22k LOC on this revision.2018 [Git HEAD=ca6a8e19... ]
version = 0.70 of open-tamil
•
We have resolved several bugs and have 84 closed tickets and 57
open tickets at this time.
Machine Learning Applications
•
We expect open-tamil project to grow in additional relevance with
surge in Machine Learning (ML) applications and requirement for
generating features from large data sets [to train the ML models]
•
In this paper we show classification of Tamil words between words
that are natively Tamil or just a direct English transliterations into
Tamil, using the features generated based on open-tamil library. We
found a 95% accuracy and 95% recall in testing dataset for models
using open-tamil and SciKit learn on Python environment
Classifier output and feature vector
•
Demo
Growth and evolution of Open-Tamil

Weitere ähnliche Inhalte

Ähnlich wie Growth and evolution of Open-Tamil

Introduction to Python Programming Basics
Introduction  to  Python  Programming BasicsIntroduction  to  Python  Programming Basics
Introduction to Python Programming BasicsDhana malar
 
Revamping Mailjet API documentation @ ParisAPI meetup
Revamping Mailjet API documentation @ ParisAPI meetupRevamping Mailjet API documentation @ ParisAPI meetup
Revamping Mailjet API documentation @ ParisAPI meetupMailjet
 
Python presentation
Python presentationPython presentation
Python presentationgaganapponix
 
Python programming ppt.pptx
Python programming ppt.pptxPython programming ppt.pptx
Python programming ppt.pptxnagendrasai12
 
Julia Computing - an alternative to Hadoop
Julia Computing - an alternative to HadoopJulia Computing - an alternative to Hadoop
Julia Computing - an alternative to HadoopShaurya Shekhar
 
Generative AI Application Development using LangChain and LangFlow
Generative AI Application Development using LangChain and LangFlowGenerative AI Application Development using LangChain and LangFlow
Generative AI Application Development using LangChain and LangFlowGene Leybzon
 
report on internshala python training
 report on internshala python  training  report on internshala python  training
report on internshala python training surabhimalviya1
 
TAUS Webinar - Introduction to the Gengo API Ecosystem
TAUS Webinar - Introduction to the Gengo API EcosystemTAUS Webinar - Introduction to the Gengo API Ecosystem
TAUS Webinar - Introduction to the Gengo API EcosystemGengo
 
Raspberry using Python Session 1
Raspberry using Python Session 1Raspberry using Python Session 1
Raspberry using Python Session 1Mohamed Abd Ela'al
 
Python-Yesterday Today Tomorrow(What's new?)
Python-Yesterday Today Tomorrow(What's new?)Python-Yesterday Today Tomorrow(What's new?)
Python-Yesterday Today Tomorrow(What's new?)Mohan Arumugam
 
How Azure DevOps can boost your organization's productivity
How Azure DevOps can boost your organization's productivityHow Azure DevOps can boost your organization's productivity
How Azure DevOps can boost your organization's productivityIvan Porta
 
Africa Series 2 Session 3 - Invoke Function Manipulation
Africa Series 2 Session 3 - Invoke Function ManipulationAfrica Series 2 Session 3 - Invoke Function Manipulation
Africa Series 2 Session 3 - Invoke Function ManipulationUiPathCommunity
 
Africa Series 2 Session 3 - Invoke Function Manipulation
Africa Series 2 Session 3 - Invoke Function Manipulation Africa Series 2 Session 3 - Invoke Function Manipulation
Africa Series 2 Session 3 - Invoke Function Manipulation UiPathCommunity
 
QTP Automation Testing Tutorial 1
QTP Automation Testing Tutorial 1QTP Automation Testing Tutorial 1
QTP Automation Testing Tutorial 1Akash Tyagi
 

Ähnlich wie Growth and evolution of Open-Tamil (20)

Introduction to Python Programming Basics
Introduction  to  Python  Programming BasicsIntroduction  to  Python  Programming Basics
Introduction to Python Programming Basics
 
Revamping Mailjet API documentation @ ParisAPI meetup
Revamping Mailjet API documentation @ ParisAPI meetupRevamping Mailjet API documentation @ ParisAPI meetup
Revamping Mailjet API documentation @ ParisAPI meetup
 
Python Course In Chandigarh
Python Course In ChandigarhPython Course In Chandigarh
Python Course In Chandigarh
 
Python presentation
Python presentationPython presentation
Python presentation
 
Python programming ppt.pptx
Python programming ppt.pptxPython programming ppt.pptx
Python programming ppt.pptx
 
PYTHON UNIT 1
PYTHON UNIT 1PYTHON UNIT 1
PYTHON UNIT 1
 
Julia Computing - an alternative to Hadoop
Julia Computing - an alternative to HadoopJulia Computing - an alternative to Hadoop
Julia Computing - an alternative to Hadoop
 
Generative AI Application Development using LangChain and LangFlow
Generative AI Application Development using LangChain and LangFlowGenerative AI Application Development using LangChain and LangFlow
Generative AI Application Development using LangChain and LangFlow
 
Selenium topic 1- Selenium Basic
Selenium topic 1-  Selenium BasicSelenium topic 1-  Selenium Basic
Selenium topic 1- Selenium Basic
 
OpenNTF Essentials
OpenNTF EssentialsOpenNTF Essentials
OpenNTF Essentials
 
report on internshala python training
 report on internshala python  training  report on internshala python  training
report on internshala python training
 
Python training Course in Mohali
Python training Course in MohaliPython training Course in Mohali
Python training Course in Mohali
 
TAUS Webinar - Introduction to the Gengo API Ecosystem
TAUS Webinar - Introduction to the Gengo API EcosystemTAUS Webinar - Introduction to the Gengo API Ecosystem
TAUS Webinar - Introduction to the Gengo API Ecosystem
 
Raspberry using Python Session 1
Raspberry using Python Session 1Raspberry using Python Session 1
Raspberry using Python Session 1
 
Python-Yesterday Today Tomorrow(What's new?)
Python-Yesterday Today Tomorrow(What's new?)Python-Yesterday Today Tomorrow(What's new?)
Python-Yesterday Today Tomorrow(What's new?)
 
What is python
What is pythonWhat is python
What is python
 
How Azure DevOps can boost your organization's productivity
How Azure DevOps can boost your organization's productivityHow Azure DevOps can boost your organization's productivity
How Azure DevOps can boost your organization's productivity
 
Africa Series 2 Session 3 - Invoke Function Manipulation
Africa Series 2 Session 3 - Invoke Function ManipulationAfrica Series 2 Session 3 - Invoke Function Manipulation
Africa Series 2 Session 3 - Invoke Function Manipulation
 
Africa Series 2 Session 3 - Invoke Function Manipulation
Africa Series 2 Session 3 - Invoke Function Manipulation Africa Series 2 Session 3 - Invoke Function Manipulation
Africa Series 2 Session 3 - Invoke Function Manipulation
 
QTP Automation Testing Tutorial 1
QTP Automation Testing Tutorial 1QTP Automation Testing Tutorial 1
QTP Automation Testing Tutorial 1
 

Mehr von ezhillang

Algorithm to Correct Missing Pulli-Signs in Printed Tamil Text
Algorithm to Correct Missing Pulli-Signs in Printed Tamil TextAlgorithm to Correct Missing Pulli-Signs in Printed Tamil Text
Algorithm to Correct Missing Pulli-Signs in Printed Tamil Textezhillang
 
எழில் மொழி பயன்பாட்டிற்கும் வெளியீடு நோக்கிய சவால்களும்
எழில் மொழி பயன்பாட்டிற்கும் வெளியீடு நோக்கிய சவால்களும் எழில் மொழி பயன்பாட்டிற்கும் வெளியீடு நோக்கிய சவால்களும்
எழில் மொழி பயன்பாட்டிற்கும் வெளியீடு நோக்கிய சவால்களும் ezhillang
 
Tamil open-source landscape - opportunities and challenges
Tamil open-source landscape - opportunities and challengesTamil open-source landscape - opportunities and challenges
Tamil open-source landscape - opportunities and challengesezhillang
 
Tamil open source_survey_2016_final
Tamil open source_survey_2016_finalTamil open source_survey_2016_final
Tamil open source_survey_2016_finalezhillang
 
Open tamil-infitt-2016 v3
Open tamil-infitt-2016 v3Open tamil-infitt-2016 v3
Open tamil-infitt-2016 v3ezhillang
 
Open-Tamil text processing library
Open-Tamil text processing libraryOpen-Tamil text processing library
Open-Tamil text processing libraryezhillang
 

Mehr von ezhillang (6)

Algorithm to Correct Missing Pulli-Signs in Printed Tamil Text
Algorithm to Correct Missing Pulli-Signs in Printed Tamil TextAlgorithm to Correct Missing Pulli-Signs in Printed Tamil Text
Algorithm to Correct Missing Pulli-Signs in Printed Tamil Text
 
எழில் மொழி பயன்பாட்டிற்கும் வெளியீடு நோக்கிய சவால்களும்
எழில் மொழி பயன்பாட்டிற்கும் வெளியீடு நோக்கிய சவால்களும் எழில் மொழி பயன்பாட்டிற்கும் வெளியீடு நோக்கிய சவால்களும்
எழில் மொழி பயன்பாட்டிற்கும் வெளியீடு நோக்கிய சவால்களும்
 
Tamil open-source landscape - opportunities and challenges
Tamil open-source landscape - opportunities and challengesTamil open-source landscape - opportunities and challenges
Tamil open-source landscape - opportunities and challenges
 
Tamil open source_survey_2016_final
Tamil open source_survey_2016_finalTamil open source_survey_2016_final
Tamil open source_survey_2016_final
 
Open tamil-infitt-2016 v3
Open tamil-infitt-2016 v3Open tamil-infitt-2016 v3
Open tamil-infitt-2016 v3
 
Open-Tamil text processing library
Open-Tamil text processing libraryOpen-Tamil text processing library
Open-Tamil text processing library
 

Kürzlich hochgeladen

(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfCionsystems
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 

Kürzlich hochgeladen (20)

(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdf
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
Exploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the ProcessExploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the Process
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 

Growth and evolution of Open-Tamil

  • 1. Growth and Evolution of Open- Tamil INFITT Tamil Internet Conference, Coimbatore, India - 2018 Syed Abuthahir M, T. Arulalan, Sathia Narayanan, Surendhar Ravichandran, A. Arunram, T. Shrinivasan and Muthiah Annamalai developerabu@gmail.com
  • 3. Installing • Python Package - https://pypi.org/project/Open-Tamil/ • Git-Hub Collaboration - https://github.com/Ezhil-Language-Foundation/open-tamil • Install Using Command – sudo pip install open-tamil==0.7
  • 4. Introduction • Help create high-level applications in tamil • Fully open-source • Many contributions from 12 developers • Library for Tamil text processing • Encoding Conversions
  • 7. What's New? • Our new contributors enabled Sandhi checker • And new Web interface based on Django for open-tamil hosted at http://tamilpesu.us
  • 8. A View of Tamilpesu
  • 9. Tamilpesu.us • Published some of important functions within open-tamil on web- interface views • The web-interface also allow a simple json API as well for the following actions 1. Word search application - 2. Tamil numeral generator - 3. Tamil sandhi checker - 4. Transliterator for Tamil - 5. Tamil word spelling checker - 6. Unicode encoding converter -) 7. Tamil anagram checker -. etc..
  • 10. Web interface For tamil numeral generator
  • 12. Sandhi Checker Web Interface
  • 16. Commad Line Utilities • Few user commands are provided in latest release of Open-Tamil project for using library functions as command line tools    1. tamilphonetic     2. tamilwordfilter       3. tamilurlfilter      4. tamiltscii2utf8   5. tamilwordgrid   6. tamilwordcount
  • 17. Quality • Open-Tamil project is developed on Github • Approximately 22k LOC on this revision.2018 [Git HEAD=ca6a8e19... ] version = 0.70 of open-tamil • We have resolved several bugs and have 84 closed tickets and 57 open tickets at this time.
  • 18. Machine Learning Applications • We expect open-tamil project to grow in additional relevance with surge in Machine Learning (ML) applications and requirement for generating features from large data sets [to train the ML models] • In this paper we show classification of Tamil words between words that are natively Tamil or just a direct English transliterations into Tamil, using the features generated based on open-tamil library. We found a 95% accuracy and 95% recall in testing dataset for models using open-tamil and SciKit learn on Python environment
  • 19. Classifier output and feature vector