![]() |
2 years ago | |
---|---|---|
test | 8 years ago | |
.gitignore | 8 years ago | |
LICENCE | 8 years ago | |
LICENCE_icu | 8 years ago | |
README.md | 2 years ago | |
c_bridge.c | 3 years ago | |
c_bridge.h | 8 years ago | |
convert.go | 3 years ago | |
detect.go | 3 years ago | |
icu_test.go | 8 years ago |
Cgo binding for icu4c C library detection and conversion functions. Guaranteed compatibility with version 50.1.
Installation consists of several simple steps. They may be a bit different on your target system (e.g. require more permissions) so adapt them to the parameters of your system.
Make sure you have build-essential installed. Otherwise icu would fail on the configuration stage.
Installation example using apt-get (Ubuntu):
sudo apt-get install build-essential
Make sure you have pkg-config installed.
Installation example using apt-get (Ubuntu):
sudo apt-get install pkg-config
Download and unarchive original icu4c archive from icu download section.
Example (for version 62.1):
wget http://download.icu-project.org/files/icu4c/62.1/icu4c-62_1-src.zip
tar -zxvf icu4c-62_1-src.zip
mv -i ./icu ~/where-you-store-libs
From the directory, where you unarchived icu4c, run:
cd source
./configure
make
sudo make install
sudo ldconfig
go get gitlab.com/lupine/icu
go test gitlab.com/lupine/icu (must PASS)
Make sure that you have your local library paths set correctly and that installation was successful. Otherwise, go build or go test may fail.
icu4c is installed in your local library directory (e.g. /usr/local/lib) and puts its libraries there. This path should be registered in your system (using ldconfig or exporting LD_LIBRARY_PATH, etc.) or the linker would fail.
icu4c installs its header files to local include folders (e.g. /usr/local/include/unicode) so there is no need to have additional .h files with this package, but the system must be properly set up to detect .h files in those directories.
Note: check icu documentation for returned encoding identifiers.
// Create detector
detector, err := NewCharsetDetector()
if err != nil {
//... Handle error ...
}
defer detector.Close()
// Guess encoding
encMatches, err := detector.GuessCharset(encodedText)
if err != nil {
//... Handle error ...
}
// Get charset with max confidence (goes first)
maxenc := encMatches[0].Charset
// Use maxenc.
// ...
...
// Create converter
converter := NewCharsetConverter(DefaultMaxTextSize)
// Convert to utf-8
converted, err := converter.ConvertToUtf8(encodedText, maxenc)
if nil != err {
//... Handle error ...
}
For more information on icu refer to the original website, which contains links on theory and other details.
ICU is released under a nonrestrictive open source license that is suitable for use with both commercial software and with other open source or free software.
The lupine/icu
binding is released under the BSD Licence
lupine/icu
is almost entirely based on goodsign/icu
,
but contains a few changes relative to that fine project.