Please create an account to participate in the Slashdot moderation system

 



Forgot your password?
typodupeerror
×
Communications Software

A Spell-Checker for Scientific Terms? 48

deaflamb wonders: "I'm a biology major and have been writing a paper for a class. I'm using Microsoft Word on my mac. It's annoying me how often I have to click 'ignore' or 'add' on the spell checker when it comes across words only used in Science. Was wondering if there where any free scientific spell checkers out there that can be added into Word or OpenOffice (since I use that too), and how well they work?" It didn't take me long to find these guys, who look like they cover a significant portion of the terms used in the medical and science world, however, their price for a single user license for only one of their specialty packs can run into the hundreds of dollars. Might there be other options that are a bit more affordable or, as the deaflamb asks, free?
This discussion has been archived. No new comments can be posted.

A Spell-Checker for Scientific Terms?

Comments Filter:
  • Not as far as I know (Score:3, Interesting)

    by poopdeville ( 841677 ) on Monday December 05, 2005 @10:51PM (#14190713)
    But this is a great idea for a collaborative, freely available project. I'm a mathematician by trade and have run into a similar problem. Want to work together?
    • by pbhj ( 607776 ) on Tuesday December 06, 2005 @07:42PM (#14198257) Homepage Journal
      1, How about you have a website where people upload their private dictionaries (with a language option too??). This shouldn't be too hard using the OOo files as they are presumably XML, might be quite server intensive though. Strip the words and tabulate them (add them to a (pgsql?) database) - you can throw any away that you have enough of or that have already been rejected as misspellings (sp?!?).

      If a word appears with the same spelling in 100 (or downloaders preference) dictionaries then it is tagged for inclusion in the master dictionary.

      Uploaders could specify the general area they write in as well as the language, eg Physical Sciences, Literature, Agriculture, ... so a dictionary request could be limited by subject field too.

      Require a dictionary upload _OR_ payment of a fee to avoid freeloaders.

      2, ...
      3, Profit ??!

      [PS: I just looked and OOo uses .aff or .dic formats.]
    • I'd definitely be interested in helping out with this. Whilst not in a scientific field as such, the financial/IT field I'm in has a lot of words not usually found in a concise dictionary. What kind of format and what kind of distribution method would be best?
      • You and anyone else who is interested should send me an e-mail. As it stand, I'm thinking that if a database of words is compiled and categorized, we can generate several different dictionary formats easily. I'm busy working on another project right now, but I'll start writing code for this in a few days.
    • by Intron ( 870560 )
      Wouldn't the simple way to do this be to take the indexes of a few of publications in a given field and process them into a list? You aren't looking for definitions or anything, just correctly spelled words.
  • Try this: (Score:5, Informative)

    by Gothic_Walrus ( 692125 ) on Monday December 05, 2005 @10:53PM (#14190719) Journal
    This page [comcast.net] has a few dictionaries up for free. I don't know if they've got quite what you're looking for, but it's worth a shot.

    Beyond that...the textbook is always a good choice. Type it, check it a few times, and then add it to the dictionary. :)

    • Scanning it and using OCR to read the words seems like a more sensible thing to do, then post-processing it by sorting out only unique words. Of course, a backside with this solution is that there WILL be some manual tidying-up to do...
      • by pomo monster ( 873962 ) on Monday December 05, 2005 @11:22PM (#14190827)
        "Downside." Trust me. You mean "downside."
        • I'm writing in Swenglish [wikipedia.org], you insensitive clod! ;-) (no hard feelings, just had to explain the error...)
          • It was a good error--"backside" means "arse." :-)

            Now lemme see if I can butcher this phrase I learnt: "Du ar så amskralig att jag kissar i mina kalsonger nar jag ser det."
            • "You are so (amskralig) that i pee in my underwear (nar) I see it".

              The dots that sometimes are above "a" are important in Swedish! (to be really exact it isn't dots above a, ä and a are different letters in Swedish. Mixing them up is like mixing up u and o in English, or something...

              är = are (to be)
              ar = are (100 m^2)

              nar = not a word in Swedish
              när = when

              amskralig = not a word in Swedish, you possibly meant "anskrämlig", but that isn't really a common word...

    • Some of those will actually work very well for the latin names involved with organisms. I tried rechecking with one of the dictionaries that related to my paper, but lots of the science specific terms that are not lating names still came up with erros. Thanks though. These will help a lot. Ray
  • Same boat... (Score:3, Informative)

    by xiao_haozi ( 668360 ) on Monday December 05, 2005 @11:39PM (#14190903) Homepage Journal
    I am finishing up a BS in Biochemistry and Molecular Biology and have battled this throughout my collegiate years as well. I have searched long and far for a solution and have thus far not found anything. I have come across a few medical versions but even they tend to be for the lay person. One solution (somewhat) has been http://wikipedia.com/ [wikipedia.com]. I know this is not a dictionary but works if you need to double check a spelling, but mainly I have found it useful while writing scientific pieces to double check a few pathways or cell types. While its not comprehensive by any means, it is coming along at a suprisingly great rate. On a quick note... The Cell (which can be found at NCBI website) is a good book reference for such purposes as is Voet and Voet's Biochemistry.
    • One solution (somewhat) has been http://wikipedia.com/ [wikipedia.com]. I know this is not a dictionary but works if you need to double check a spelling, but mainly I have found it useful while writing scientific pieces to double check a few pathways or cell types.

      AbiWord [abisource.com] is a capabale F/OSS word processor which is available for most platrforms. It is lighter than OO.o Writer (though that also means it lacks SOME of OO.o's features). One of the really nice features is that it supports a number of plugins. There are pl

  • by bluelip ( 123578 )
    The last time I played w/ spellcheckers, the 'soundex' function was tops. It basically mapped phonetic sounds to values and summed them.

    The problem w/ scientific terms is that the rules and patterns that compose a soundex value don't hold up wo complex words.

    The same approach may possibly be taken, but the patterns and values will need to refined/redefined.

    • wikipedia has a better description of the soundex algorithm here:

      http://en.wikipedia.org/wiki/Soundex [wikipedia.org]

      Soundex is a phonetic algorithm for indexing names by their sound when pronounced in English. The basic aim is for names with the same pronunciation to be encoded to the same string so that matching can occur despite minor differences in spelling. Soundex is the most widely known of all phonetic algorithms and is often used (incorrectly) as a synonym for "phonetic algorithm".
    • Soundex really sucks. Don't use it. The whole "first letter stuck on the front" negates much of its value. Korn will never match Corn.

      Metaphone is better.
  • Simple solution? (Score:1, Interesting)

    by sithkhan ( 536425 )
    Why not go to Wikipedia or MIT.edu or some other website that that has text with the subject matter you are studying and simple copy and paste, and then add to dictionary? Or look on Gutenberg.org and copy and paste those. It worked for me when I was writing a paper on The Oddessey and had to use those uncommon Greek spellings. YMMV.
  • This sounds like a good case for a site with user contributions added to genre specific dictionaries - eg: Biology, medical, IT - pretty much any field with a very specific syntax could benefit...
  • Just suck it up... (Score:5, Insightful)

    by Otter ( 3800 ) on Tuesday December 06, 2005 @12:14AM (#14191046) Journal
    Realistically, you only use a tiny subset of scientific vocabulary. OK, as an undergrad you face more breadth than a researcher does, but still.

    Just suck it up. Add words to your dictionary as you go, and within a month you'll rarely see those squiggly red lines. For some reason, people are too intimidated to just start into it.

  • by Anonymous Coward
    Those words are irreducibly complex, clearly indicating that a concious mind, or intelligent designer created them (yes, a human "actually" created them, but since this human was created by an intelligent designer, it comes out the same) . By using a spell checker, you are indicating that, somehow, YOU are able to ascertain the correct spelling, when only the intelligent designer could possibly know this (see my study of pirate population vs. climate warming for more proof). Therefore, simply asking about t
  • Pubmed (Score:4, Insightful)

    by virology-not for com ( 841426 ) on Tuesday December 06, 2005 @01:03AM (#14191243)
    I had this idea a while ago. Probably every science major does. Anyways, dictionary files are simple, just the word followed by an endline. So all you need is a good database. A good one for biology is pubmed over at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=P ubMed&itool=toolbar [nih.gov] They offer a way to download all their abstracts. Most are spell checked, and they should, in culmination, include most every biological word. So, download their abstracts (i think they are in xml) parse them (delete duplicates and words in a normal dictionary... maybe words with numbers) and put them into a txt file followed by an endline. Done.
  • Google Scholar (Score:2, Informative)

    by phyy-nx ( 544808 )
    I know this isn't an autmated solution, but whenever I need to know how to spell a scientific word, I use Google Scholar [google.com]. I take a guess at the spelling, search for it, and google will often prompt me with the correct spelling. If I get thousands of hits but all happen to be wrong then hey, at least I'm spelling the word the same way thousands of others have :)
  • by Lifix ( 791281 ) on Tuesday December 06, 2005 @03:33AM (#14191778) Homepage
    I can't tell you how few people remember to backup their dictionary when they backup a computer. I worked as an intern in the IT department at my school and tech savy faculty members would regularly loose 3+ years of work on a custom dictionary because they failed to back it up. I suggest you just struggle through, and add all the words to the dictionary, but remember to keep a copy somewhere.

    The ability to customize your dictionary is something that most people in the tech world don't talk about much, but I use it every day of my life: by day I work at a computer retail store, but at night and on the weekends I'm a writer and a custom dictionary keeps me from screwing up proper nouns that I am using.
  • At one time you could get specialty dictionaries for Nisus from the Nisus company. I wrote my thesis with Nisus, and then imported the text without formating into Word 5 for the Mac. Had to do that as I submitted my thesis via e-mail attachment to my thesis director who had a M$ Windows box.

    Apple are you listening? The spell-checking feature built into Mac OS X should have a file or something of that nature that could be imported into the system dictionary file/folder.

    Indeed, and interesting project for us
  • Why not maintain a site-wide dictionary? Of course, if you want to save your
    own time only, you can only buy one. Actually, this is an area where users
    may help openoffice to become more successful. Create and maintain a scientific
    terms spelling database. Make it an optional extension for the spell checker.
  • you could just take some scientific texts from your profession that are available electronically and that are already proofread.

    tokenize them, sort the words, filter out duplicates (maybe only keep words that appear more than once), and voila, you have a spelling dictionary.

    Not sure about copyright but I think it should not be a problem for this use.
  • Medical Dictionary (Score:2, Informative)

    by sgent ( 874402 )
    I've used medical dictionaries in the past. Stedmens is probably the most well known -- and make their dictionaries available in digital form for import into Word, etc. They also have legal and some other terms. http://www.stedmans.com/category.cfm/210 [stedmans.com]
  • Mathematicians and mathematical scientists (is there some other kind of scientist?) use LaTeX, which has a spell checker for technical terms. If you type your LaTeX wrong, you get this:

    error: ! Undefined control sequence

    Then you know to fix your spelling, and the line and column number.
    You can also run LaTeX through a normal spell checker, and it will ignore the technical terms. aspell does this automatically, but ispell needs the -t option. LaTeX is a pain to learn, but it is worth it.
    Unfortun

  • A Spell-Checker for Scientific Terms?

    Hey why stop there? It would be great if someone developed a Spell-Checker for Hip-Hop terminology. Because I have been having so much trouble when I write my rap songs. I can't figure out where to put the "izzles" and "eezies".

  • Plants' Latin names (Score:2, Informative)

    by skinfaxi ( 212627 )
    I've been working on a web project about vegetable seed identification and did some searching for a very similar thing (a dictionary I could import into MS Word that had the Latin names of plants). I didn't find anything appropriate - it's particularly sticky because some folks use different names to mean the same plant, and some use the same name to refer to different plants! As in many scientific fields, there are efforts to standardize the nomenclature so everyone is talking about the same thing using th
  • The "large" ispell dictionary has a lot of biological and medical terms. It might serve as a good starting point.
  • I work part-time as a freelance copyeditor for medical and scientific publishing. I use Dorland's Medical Dictionary as my reference for medical/scientific terminology. Of course the dictionary is geared towards medical use, so if the scientific terms you need are, for example, from botany or astrophysics, this probably would not be of use. Dorland's does, however, come with a CD-ROM which includes a dictionary/spellchecker for use with with MS-WORD, so I find it immeasurably valuable.

The Tao is like a glob pattern: used but never used up. It is like the extern void: filled with infinite possibilities.

Working...