Become a fan of Slashdot on Facebook


Forgot your password?
Programming Technology

Tools for Debugging Stack Corruption? 30

blackcoot asks: "I know that there are tools which exist on hardened Linux distros and OpenBSD (and probably $your_os_of_choice too), which are designed to help track down stack corruption (which is often symptomatic of buffer overruns). Unfortunately, that's about all I know about those tools. What options are there? How effective are they? What does it take to get access to those tools? Are they really useful enough to make the effort justified?"
"My goal here is to increase my effectiveness at hunting down memory bugs, not necessarily to produce bullet proof, secure production quality code — the bugs I'm dealing with are, I believe, largely in software delivered by a subcontractor who swears they test their code and can't reproduce my bugs. What I really want is to a) demonstrate to them that a problem does, in fact, exist; b) demonstrate that the problem exists inside their code; and c) give them the tools they need to find, repair, and verify that the bug is no longer an issue.

First prize in my mind would be a Valgrind like tool which only requires trivial changes to the build process, but I'm pretty open to suggestions. If I have to run a hardened Linux to make this all possible, suggestions on pretty leading edge distros with reasonable automagic self configuration and hardware detection + laptop support would be greatly appreciated. Thanks much!"
This discussion has been archived. No new comments can be posted.

Tools for Debugging Stack Corruption?

Comments Filter:
  • Electric Fence (Score:4, Informative)

    by sleepingsquirrel ( 587025 ) <Greg,Buchholz&sleepingsquirrel,org> on Wednesday January 11, 2006 @03:07PM (#14447737) Homepage Journal
    See Electric Fence [] and your documentation for debugging malloc [].
    • Also, you can use the Boehm garbage collector as a leak detector [].
    • CCured (Score:3, Informative)

      Also, something I've never tried, but always been interested in is CCured []...

      CCured is a source-to-source translator for C. It analyzes the C program to determine the smallest number of run-time checks that must be inserted in the program to prevent all memory safety violations. The resulting program is memory safe, meaning that it will stop rather than overrun a buffer or scribble over memory that it shouldn't touch. Many programs can be made memory-safe this way while losing only 10-60% run-time perform

    • Electric Fence can only detect heap corruption, but I think the submitter meant memory corruption of any kind. Valgrind can detect both.
  • A Couple Of Tools (Score:5, Informative)

    by BusDriver ( 34906 ) <> on Wednesday January 11, 2006 @03:11PM (#14447763) Homepage
    You don't have to run a whole new distro!

    First of all, there's libsafe [] which is just a simple compile and install. Seeing as we don't know much about your specific problem, I'm not sure if it'll help.

    Second to try is compiling your own custom kernel with the GrSecurity [] kernel patch. It has as part of it the PaX [] kernel patch which is very effective at protecting against overflows. You could even just install the PaX kernel patch itself, but I believe the version in GrSecurity is kept more up-to-date. You can compile it with protection turned off by default, but using the PaX tools turn on protection for just the binaries you wish to check.

    Installing either (or both) of these could well help you, without the need to blow away your current install and start fresh.

  • by Z00L00K ( 682162 ) on Wednesday January 11, 2006 @03:35PM (#14447976) Homepage
    As stated, Valgrind and Electric Fence are two alternatives. I would also like to point out Purify [] (now owned by IBM) which I have been using from time to time. Of course the catch with that tool is that it's commercial, but you should at least evaluate it.

    Using a complete hardened Linux distro is not necessary for "normal" development work, but it may be a good idea to verify that your application actually works there too.

    In addition to the run-time checkers you can also look for static checkers like Splint []. It can provide you with some extra information about potential problems that only occurs under certain conditions that maybe aren't met during runtime.

    If the effort of trying to track down stack corruption is worth it? - YES! Absolutely! Catching bugs in an early stage is essential to keep down the lifecycle cost of a system. Also consider the risk of badwill if your product is prone to strange behavior and crashes.

  • by kbielefe ( 606566 ) * <> on Wednesday January 11, 2006 @03:53PM (#14448165)
    The replies I have seen so far have all been excellent suggestions for detecting buffer overflows on the heap. Adequate stack protection actually requires the code to be compiled with a compiler that adds extra checks to each function call. This page [] has more information on making gcc do what you want. Gentoo is very easy to set up for it, FYI, but it should be possible on any *nix distro without any kernel changes.

    I feel your pain about bugs in libraries that you must use without the source code. I had an arrangement like that for nearly two years with extremely buggy code. Just relinking the static library with changes to my code, changing where in memory the library would reside, would often cause huge problems. Let's just say I got really good at debugging in assembly with gdb. I got where I could call them up and say something like, "you have some code at the end of function foo that looks like 'a[2] = b', but a was never allocated." They'd always reply with something like, "Yes it is ... oh wait..."

    • i really, /really/ want to avoid the assembly route if at all possible. first, i have a healthy dislike of assembly. but more importanty, analyzing the assembly requires some reverse engineering which has the potential to land me in all sorts of trouble. our collaborators on this particular project are potential competitors on other projects (my company is in a rather small niche), which is why neither of us has the complete code base.

      i'm going to try to arrange to send them a set of object files which can
  • Excuse me! I don't have $ in front of my variables!

    I don't like yourOsOfChoice style naming, so I'll give you that...
  • If you don't mind spending money, get Rational Purify from IBM. They seem to properly support Linux now, and they are the gold standard. You can download a 2 week evaluation license to give it a try.

    As far as your build process, you just take whatever your link command is, and tag "purify" on the beginning of it. The tool will instrument all compiled objects that you are linking in, and produce a binary that does all sorts of really useful checking (read/write of freed memory, buffer overruns, memory leak

    • If you are developing software for money, there is nothing better.

      I've found ccured more consistently finds problems than Purify. Plus it's faster, and free. It's also much harder to set up to work with large projects. So better in some significant ways, and worse in other significant ways. (also it doesn't find leaks but Boehm works fine for that).

      • i suppose that ccured doesn't handle c++. personally, i've got nothing against spending money to get the right tools to solve the problem. whatever gets me from showstopper bugs to running stablely for indefinite amounts of time quickest is the path i'm interested in.
  • Ultimate solution (Score:2, Redundant)

    by Nutria ( 679911 )
    Don't use C!
    • i'm using c++, so i guess that means i'm using c too. it's not difficult to avoid about 99.99% of memory bugs in c++ by judicious use of the stl, avoiding dynamic allocation wherever possible, asserting that indices are always in bounds, etc.

      unfortunately, the folks supplying the code do not use those techniques and i don't have the luxury of re-educating them.

      as for why i'm using c++: implementing about 95% of this particular application in any language that isn't compiled to machine code with a darn good
      • FORTRAN? Ada? Pascal?

        Their GCC optimization probably isn't that hot, though.

        One of the really nice things about VMS is the Common Language Environment. This makes it possible to create executables out of object files written in multiple languages: C, Bliss, Macro/Asm, BASIC, COBOL, FORTRAN, Ada, etc. All of the language parsers create an intermediary language which the GEM common backend code generator takes and uses to build .obj files.

        Yes, .NET does that, and so does GCC 4, to a degree, but VMS has ha
        • all good languages. pity i don't speak them. intel's compilers are pretty darn good at optimization (you pretty much get about 10-15% speed up v. gcc with a recompile, more if you're willing to throw more disk space and time at the problem and more still if you link in their math and image libraries). it's not free, but it's cheap enough that it doesn't have to save much time before it's paid for itself.
  • You may find DieHard useful, if heap errors are also a problem -- it's a plug-in replacement for malloc/free that provides probabilistic guarantees of correct execution in the face of errors. []

    -- emery
  • gcc -fbounds-check could find some memory problems (and some non-problems). I used this feature over 6 years ago before it was integrated into gcc. YMMV.

A committee takes root and grows, it flowers, wilts and dies, scattering the seed from which other committees will bloom. -- Parkinson