SBBIC Khmer Word Breaker Using ICU

We’ve been working on getting code into ICU to allow Khmer Unicode to automatically break between words and the newest release of ICU now includes a Khmer word breaker.  But access is difficult (unless you are a programmer).  So we have made a small program that uses ICU and will allow you to use the Khmer word breaker in Linux (Windows will come soon).  We’ve only tested this on Ubuntu 11.x so please test it and let us know if you have any problems. There is still room for improvement, so please let us know how it works for you.

The word-breaker is currently dictionary based, so it will work best on documents that have correct spelling.  In the future we hope to add additional programming that will better deal with “unknown” words.

To use the program in Ubuntu place the Unicode .txt file you want to break in the same directory as sbbic-khmer-breaker.out and open the console to the directory where sbbic-khmer-breaker.out is located and type: ./sbbic-khmer-breaker.out yourinputfile.txt youroutputfile.txt (changing the names of the text files to the names you desire).

Again, if you have any issues, please don’t hesitate to ask in the comments.

DOWNLOAD: SBBIC Khmer Word Breaker Using ICU (36)

Latest Khmer Grammar Checker for OpenOffice and LibreOffice

This latest release includes the ability to ensure all quotes and brackets have been closed as well as adds some additional word-coherency checks (to make sure you followed the same style of spelling throughout your document – our list adheres to Chuan Nath’s spelling whenever possible).

This extension can be used with:
OpenOffice
and
LibreOffice
which are both free, and opensource word processors.

Please let us know in the comments if you have any trouble, or would like any additions to the grammar checker.

DOWNLOAD: SBBIC Khmer Grammar Checker (13649)

Tatoeba: Online Sentence Collaborative Dictionary with Khmer

We just came across a site called Tatoeba that is a community designed to create an online sentence dictionary.  Khmer has not been officially added, but it is in the works (you can add your own sentences and translations and then add them to the public Khmer list here: http://tatoeba.org/eng/sentences_lists/show/765

Visit: Tatoeba

Read the Christian Khmer Bible Online

The two versions of the Khmer Bible can now be viewed online for free in Unicode!

Special thanks to Maurice Bauhahn and the Bible Society in Cambodia!

 

សាកល្បង​កម្ម​វី​ធី​ត្រួតពិនិត្យ​វេយ្យាករណ៍​ថ្មី​បំផុត សម្រាប់ OpenOffice

យើង​កំពុងធ្វើការ​ចេញផ្សាយ​កម្ម​វិធី​ត្រួតពិនិត្យ​​ វេយ្យាករណ៍​ភាសា​ខ្មែរ ជំនាន់ ALPHA ៣ ដោយ​ប្រើប្រាស់ ឧបករណ៍​ភាសា។ សូម​សាកល្បង​ហើយ​ប្រាប់​ពួក​យើង​ថា​អ្នក​គិត​យ៉ាងណា​ដែរ។ លក្ខណៈ​ថ្មី​បំផុត​របស់​កម្មវិធី​នេះ​គឺ​សមត្ថភាព​ក្នុង​ការ​​រក្សា​បាន​​ ភាព​ដូចគ្នា​ក្នុង​ការ​ប្រកប​រាល់​ពាក្យ​ទាំងអស់​ក្នុង​អត្ថបទ​ទាំងមូល។ យើង​យក​តាម​​ការ​ប្រកប​ភាគច្រើន​នៅក្នុង​វចនានុក្រម សម្ដេច​​សង្ឃ ជួន​ណាត។
សូម​ធ្វើ​ការ​ទាញ​យក​កម្មវិធី​នេះ ហើយ​ធ្វើ​ការដំឡើង ជា​កម្មវិធី​បន្ថែម​ទៅ​ក្នុង OpenOffice 3.x

ទាញ​យក៖ SBBIC Khmer Grammar Checker (13649)

Please Excuse Our Mess

The multilingual portion of our site is in the process of being updated. Please excuse the mess as we update our site. We hope the new features will remove some of the problems our users have been having with the Khmer portion of the site. If you are interested in helping with translation of the site, please let us know in the comments.

Thank you!

Khmer Spelling Checker for Adobe InDesign CS 5.5

UPDATE: Our solution does not yet work perfectly – line-breaks do not work with a hair space, so we are still in the process with Adobe trying to find a solution that will work without any issue.

With the release of InDesign CS 5.5 Hunspell dictionaries are now supported.  This means we can use the SBBIC spelling dictionary with InDesign!  There are some issues though, because InDesign was not tested fully with Khmer, but we are able to get around them (even though it makes things a bit complex).  Right now our solution is MAC ONLY because I don’t have my PC here with me – but we will include PC instructions soon (and they won’t me much different than the Mac instruction).

 

Here are the files you will need:

 

File 1: Khmer Mac Unicode Keyboard for InDesign by SBBIC (171)

 

File 2: World Composer Template Files (109)
Thanks to: http://www.thomasphinney.com/2009/01/adobe-world-ready-composer/ for these templates

 

File 3: SBBIC Khmer Spelling Dictionary 1.4 for InDesign (154)

And here is the video tutorial:

New Release of Khmer Grammar Checker

The latest Alpha release of the SBBIC Khmer Grammar checker is out for you to download.  The latest addition is the detection and correction of repeated words (i.e. ហើយ​ហើយ).  Download it and install it as an extension in OpenOffice 3.x and let us know what you think!

DOWNLOAD: SBBIC Khmer Grammar Checker (13649)