Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

 



Forgot your password?
typodupeerror
×
Linux Software

Linux-Based Instant Messaging for non-English Users? 31

An anonymous reader asks: "Like many others these past few weeks, I took the time to download the latest RedHat (replace with your favorite distro here) and upgrade my system. Despite the usual mail hangovers (corporate mail is still Outlook through POP3, etc.), the new *Office suites are great and I can almost dump Windows. However I was amazed at the sorry state of Linux instant messaging. Before you flame me, mod me down or doom me to a lifetime of Windows usage, allow me to explain: I am not a native English speaker, and it seems that every single Windows IM client I use (except Trillian) can deal properly with accented characters. Worse, every third-party Linux client I have tried deals with them differently, resulting in garbled (vaguely Unicode-ish) junk! Does the Slashdot crowd (especially the non-English folk) have a solution for this? (Short of VMware and Win32 clients, that is. Wine doesn't work at all for me)"

"Portuguese (Continental) is my native language, and I speak French, Spanish, some German and (after spending quite a few months in Poland on a project) passable Polish. I speak those languages practically every other day, and besides e-mail, I have taken to using MSN and Yahoo to discuss work with my clients and colleagues.

I have thus far tried GAIM (both the RedHat 8.0 bundled and the CVS versions), Kopete, Everybuddy, the native Yahoo! (which crashes more often than not and does not even deserve the 'beta' moniker), and none of them are suitable. IRC does the job, sure, but most of the people I have to reach can't use it (firewalls, usually) and won't install another IM client, so the solution has got to be at my end."

This discussion has been archived. No new comments can be posted.

Linux-Based Instant Messaging for non-English Users?

Comments Filter:
  • by lightspawn ( 155347 ) on Thursday October 17, 2002 @06:43AM (#4467870) Homepage
    The problem is lack of standardization. Windows clients assume (I'm guessing, based on my experience) an 8-bit character set; some values in the higher 128 just happen to be mapped to different characters in different locales, so if the locales match there's no problem but if it doesn't you get garbage.

    Ideally, every text would either be unicode or declare the encoding, but of course that's not going to happen. So we want an IM client that knows about different encodings and can be told that a certain person sends and receives messages in a particular encoding (and probably a way to specify a relation between encodings, keyboard bindings, and fonts - or is that the responsibility of the operating system or window manager or KDE or whatever the architecture is).

    As far as I know, no such client exists.
    • by Anonymous Coward on Thursday October 17, 2002 @07:06AM (#4467928)
      No, Windows does not use an 8-bit character set. It's far more complex than that, and I (who happen to have mostly the same problem) personally blame it on the laziness of English-native programmers.

      Trillian, for one, does not support accented characters properly either, and it's a Win32 app. If it handled ISO-8859-1 encoding properly, it might just about work.

      Gaim, AFAIK, suffers from the same malaise, and I could never get the Linux Yahoo client to run twice properly (I have to keep removing the ~/.ymessenger dir)

      Fire (http://www.epicware.com, for the Mac) allows you to select different character encodings, and I have nearly perfect interaction with Win 32 clients once I figure out their settings (except for Yahoo, who are notoriously finicky and keep changing their protocol)

      And as for the "Just speak English" replies... Well... That's an attitude problem I certainly am glad half the world doesn't have. :)
      • by RobFlynn ( 127703 ) on Thursday October 17, 2002 @04:30PM (#4472968)
        Receiving international text works properly in Gaim in the CVS version. Support for sending internationalized text is currently being added.
      • No, Windows does not use an 8-bit character set. It's far more complex than that, and I (who happen to have mostly the same problem) personally blame it on the laziness of English-native programmers.

        Effectively, yes. Windows does use an 8-bit character set for average consumers. If you code your apps to use 16-bit Unicode characters, they will only run on an NT-based version, and not at all on Win95/98/98se/ME... (Remember, there is not really a CreateWindow function in win32, just a macro that will resolve to either CreateWindowA or CreateWindowW )

        One of the problems is the evil TCHAR that MS and MFC pushes on unknowing developers. They themselves do not use either, but instead had their own proprietary means. Recently, however, they did go and publish these libraries [microsoft.com] as a standard extension... (IIRC, this could be one of those "hidden API" things that the Office team created out of need)

        The main problem usually comes in with the mix of controls, protocols, and approaches that people take.

        (However, programmer laziness is a lot of the cause of the problems you encounter)

  • by DeadSea ( 69598 ) on Thursday October 17, 2002 @06:56AM (#4467906) Homepage Journal
    I regularly use gaim and GnomeICU. They are AIM and ICQ clones respectivly. From my experience, they are just as good as the Windows clients. Better in some respects: they take much less memory, don't have ads, and have applets that run in my task bar.

    I'm an English speaker, but I just opened a window in each and sent a friend a message copied and pasted from a Spanish website. The messages went through just fine with accented characters and all.

    • Everybuddy should work fine as well and if you are having language issues just give the mailing list a shout at everybuddy@lists.spine.cx

      Take care - RL
    • by lightspawn ( 155347 ) on Thursday October 17, 2002 @07:16AM (#4467954) Homepage
      I just opened a window in each and sent a friend a message copied and pasted from a Spanish website. The messages went through just fine with accented characters and all.

      Try it with, say, Hindi or Hebrew or Thai - a language with completely different characters, not just accents.

      • by DeadSea ( 69598 ) on Thursday October 17, 2002 @07:29AM (#4467987) Homepage Journal
        Ok, I tried it with several languages:
        • Arabic - Lots of question marks.
        • Chinese Simplified - Lots of question marks.
        • Chinese Traditional - Lots of question marks.
        • German - Works!
        • French - Works!
        • Hebrew - Lots of question marks.
        • Japanese - Lots of question marks.
        • Polish - Mostly works, some question marks.
        • Spanish - Works!
        • Italian - Works!
        Based on these results, GnomeICU and Gaim use the ISO 8859-1 Latin 1 character set. The European languages that use these character sets work fine. Languages such as Polish that use ISO 8859-2 Latin 2 only mostly work, and languages that use other character sets really don't at all.

        I found no way to change your character set in either of these programs, although they may use your system's default character set.

        Mozilla has no problem displaying any glyphs from these character sets on my computer. It would be nice if other Linux programs got to that point too. (I tested this by copying text from the World categories of the open directory project [dmoz.org].)

      • gaim cvs together with a patch of mine (grab on the gaim-devel mailing list allow to use almost any language. I've tested Cyrillic, ethiopian, japonese, French with only succes so far. So please be a little patient, and it will come soon.
  • by DeadSea ( 69598 ) on Thursday October 17, 2002 @07:56AM (#4468062) Homepage Journal
    This seems like an apropriate time to take a little break and discuss how a programmer would design an application to handle whatever language is thrown at it. I'm primarily a Java programmer, and I like the way that Java handles internationalization. When you program in Java, you just expect it to work. The Java APIs have been designed around internationalization from the start.

    There are three levels: bytes, character sets, and glyphs. Your program recieves a stream of bytes and you have to display those bytes to the user as text in the correct langugage. That display is done using glyphs (font characters). A character set maps bytes to glyphs.

    There are tens if not hundreds of different character sets. A character set might map each byte to a different glyph (latin 1 and 2), only some bytes to glyphs (ASCII), multiple bytes to a glyph (Unicode), or varying numbers of bytes to glyphs and some bytes to no glyph (UTF-8).

    Java APIs handle this by reprensenting all Strings internally in Unicode. Unicode is the granddaddy of all character sets. Almost every glyph has a value in Unicode. When you get a stream of bytes in java you can use a Reader to translate that stream into Unicode. The Reader is constructed with the name of a character set. If no character set is specified, the system's default is used. The character to be used usually comes from meta-data. In html for example, the character set for the page is transmitted by the server in the data that comes before the page itself is delivered (the http header.)

    Once you have a unicode string it is straighforward to find a glyph to display for each character. This all depends on the right fonts being installed, but usually APIs handle it for you.

    I18n problems usually occur when a programmer doesn't know to how to translate bytes into unicode characters. The programmer may always use the default character set, ignoring any meta-data. Similarly on sending data, the programmer must tell the other end with which character set the data is sent. Other problems may occur when a needed font is not installed.

    Often, a system works with a specific character set that doesn't support all characters (such as latin 1). When more characters are desired in such an instance, escape characters are often used. There are \uXXXX style escape sequences in source code, and &XXX; escape sequences in html. Such escape sequences may be able to retrofit an older system in which a specific non-inclusive character set is assumed.

  • AMSN (Score:4, Informative)

    by awx ( 169546 ) on Thursday October 17, 2002 @09:01AM (#4468406)
    Look at amsn. [sourceforge.net]. Large list of languages supported in the menu systems etc, although not having tried it apart from in English mode I don't really help you that much. Worth a shot tho, it's the best MSN client out there...
  • Licq (Score:3, Informative)

    by Papineau ( 527159 ) on Thursday October 17, 2002 @09:57AM (#4468788) Homepage

    My experience with Licq is that it works perfectly in ISO 8519-1 (I use mainly French). In the options you can select other languages as well (UTF-8, ISO 8519-2, ISO 8519-6, CP 1256, KOI8-R, JIS7, etc.), but I never used them. Just make sure the proper fonts are installed, and you should be good to go.

    The only problem I see is that you don't mention ICQ in your possible choices...

  • ...(or Latin-1 using) programmers for not developing fully i18nized code, perhaps developers who are users of these languages should be contributing to the development of projects that don't handle them well (or at all) to help solve the problem.

  • Licq (Score:2, Informative)

    by Skinny Rav ( 181822 )
    Licq does pretty good job both with displaying different languages and localisation (at least it is quite well translated to Polish). The only problem there is that you can choose encoding on program-wide basis, so it will be difficult to chat with different people in their native languages.

    So I would recommend it for western languages and gg or ekg (two excelent gadu-gadu clients) for chats with your Polish friends

    Raf
  • Well as a KDE developer and as someone with the last name of Meyer I frequently get e-mail in German even though I do not speak it. Because of this I wrote a plugin for Kinkatta that allows me to talk to other people in different languages. I write in english, It translates it to german and send the message. A message comes in and it is translated from german to english and passed in. Although bablefish doesn't do that great for large articles snort sentences back and forth work fine. Many people I know in turn have then used Kinkatta for the same purpose and I myself have fixed several language errors.
  • State of Linux IM (Score:4, Insightful)

    by 0x0d0a ( 568518 ) on Thursday October 17, 2002 @12:02PM (#4469992) Journal
    The state of Linux IM *is* unfortunately pretty awful. The only consolation is that Windows IM isn't all that much better.

    Let's take a look.

    Pros: no Linux IM client that I know of has ads or relies extensively on client-side security (*cough* ICQ).

    Cons:

    * There is no Linux ICQ client that can consistently transfer files to Windows users. GnomeICU doesn't support it with any recent ICQ version. Licq requires you to convince the other person to manually flip a protocol switch and only sometimes works then. Licq's file transfer does not work with Trillian (which is growing in popularity). There's a host of IM dead projects, all of which only partly work.

    * Protocol lag. I don't know of anything that can *fully* speak the new ICQ protocols.

    * Periodically doesn't work. Not sure how bad this is on GAIM these days, but it used to be awful as AOL and MS duked it out. Sudden, unexpected protocol changes are the name of the game.

    * Only reasonable ICQ front end requires Qt: Licq is the most usable Linux ICQ client I know of, and the best gtk front end *sucks*, with only one feature over the qt version (auto-establishment of secure connections). It's unstable, doesn't have middle-button-opening of message windows, doesn't highlight usernames in the username clist...

    * Gabber *could* be an answer, but isn't. There's a really nice, secure Jabber client called Gabber. Unfortunately, no one *uses* Jabber, and the gateways to other protocols for it are flaky as hell.

    * talk is dead. It was the most reliable system I've ever used to talk to people on any computer system, VMS or UNIX.

    Meanwhile, IM continues to be a crucial part of the desktop, even entering the business world. I'd like to see Red Hat on some of the groups shaping this stuff (as Microsoft and AOL are). I'd *hate* the standard protocol to be unencrypted or have a message size limit, or rely on propriatary Windows libraries.
    • Re:State of Linux IM (Score:3, Informative)

      by RobFlynn ( 127703 )
      * OSCAR, IRC, and JABBER file transfer support is working int he CVS version of Gaim (last I checked). We will be continuing development to add transfer support to the rest of the protocols (ICQ included)

      * Re: periodically not working: We have not had any problems with oscar for nearly a year. Any problem has been immediately resolved.

    • Re:State of Linux IM (Score:1, Interesting)

      by Anonymous Coward
      HEre's an idea, instead of complaining "nobody uses Jabber", you could actually USE it. The new AIM bridge isn't bad at all, but it is of course better if you can convince people to use Jabber, than just use it as a bridge. It is certainly better in that it does everything natively via unicode. What is listed here (as far as protocol changes) isn't about Windows vs. Linux clients, it's about "Official" clients vs. "Unofficial" clients. Of course the 3rd party clients can't easily and fully support IM Protocols that change every week - but we shouldn't expect that they could. That being said, GAIM works fine over AIM to recieve and send Japanese text last time I checked. (Both systems had Japanese as the default Locale, one was redhat 8 + gaim, one was windows + gaim).

      -- dr.Noah
  • by Ethan ( 9204 ) on Thursday October 17, 2002 @01:09PM (#4470606) Homepage


    Oh ho HO! International chat certainly is a problem in the Linux community, owing to many factors; not the least of which being that the developers of the major IM forces out there seem to largely be from ISO-8859-1 locales. Thus ISO-8859-1 works pretty well, and other, more ASCII-deviant (CJK) locales work with virtually no success.


    The good news is that an answer is here! I've been on a crusade to make Gaim [sourceforge.net] the penguin-pimpin'est international chat machine available, and it's really paying off! For the stable series of gaim (0.59.x, currently at 0.59.4 with 0.59.5 to be released possibly as I type this) (I just looked and 0.59.5 is out), if your locale is set correctly you should be able to chat in whatever language your little heart desires... (I have personally successfully used it with English and Japanese) as long as you aren't chatting over the AOL Instant Messenger service or the ICQ service, both of which use the Oscar protocol. However, MSN and Jabber, for instance, should be substantially correct.


    The fabulous news is that the development version of gaim coming at us right now has first class i18n support on the whole gamut of protocols! With a timely move to Gtk+ 2.0 [gtk.org] and the Pango text formatting system, Gaim now has international text formatting second to none in the OSS community and hardly rivalled in the commercial world. Images like this shot of gaim displaying Japanese, Russian, and English simultaneously [sourceforge.net] display what I'm talking about very nicely. Not only can we do non-English text, thanks to UTF-8 we can do all of the modern languages of the world simultaneously. In addition, support for internationalization on the troublesome Oscar (AIM and ICQ) protocol has been added and is coming along very nicely.


    In short, look for the next major release of gaim to clear up these issues in a big way. For those hardy souls wanting to test the code that's currently in CVS, please note that it is NOT currently complete, and isses that you have are most likely transient.


    Also, please be aware that your locale MUST be set correctly for internationalized programs to work the way you expect. Programs that only deal with your system can be more forgiving, but programs that communicate over the network absolutely must know more about your locale, including your character set. If the output of your 'locale' command lists LC_CTYPE as, for instance, "C", it's no wonder i18n isn't working! Set your LANG or LC_CTYPE correctly for your language (en_US for English with ISO-8859-1, es_ES or es_MX for Spanish, pt_PT or pt_BR for Portuguese, ja_JP for Japanese, zh_CN for Chinese, etc.) and you might see general i18n support improving dramatically.

  • found at psi.sourceforge.net [sourceforge.net] of course its a Jabber client, but as long as the jabber server you choose aint blocked it'll have a gateway to MSN, ICQ and AIM (unless ofcourse the admin dont want to provide a gateway, you can ofcourse run ur own server)

    Psi features the following:
    * Message (ICQ-style) and Chat (AIM-style) modes
    * Drag and drop to send to multiple contacts
    * Full Unicode support
    * Secure connections
    * Saving contact list locally, and server sync on login
    * Icon Themes
    * Agent registration and searching
    * Retrieving and updating User Info
    * Sound support for incoming events
    * Auto-away after a configurable amount of time
    * Tray/dock icon for KDE/GNOME environments
    * Language plugins

    It also available for MS Windows, Mac OS X as well as X (Linux)
  • Try pebrot (Score:1, Informative)

    by Anonymous Coward
    It's a console MSN client and AFAIK it works perfectly in spanish (haven't tried on other langs, but iso8859-1 ones should work) Give it a try http://pebrot..sourceforge.net [sourceforge.net]
  • As per the subject -- the Jabber [jabber.org] protocol has inherent UTF-8 support. You'll have to try the clients yourself to see which ones take advantage of it properly, but the protocol itself is most certainly non-language-specific.

Living on Earth may be expensive, but it includes an annual free trip around the Sun.

Working...