Get This Blog via Email:

Powered by Squeet.com

My Links

Home
Readify
Google
Contact
Syndication

Associations

Blog Stats

Posts - 1112
Stories - 0
Comments - 1281
Trackbacks - 477

Post Categories

Image Galleries

.NET Community

.NET Rocks!
Adelaide Dot Net Users Group
ASP.NET
AspAdvice
ASPalliance
ASPInsiders
Canberra .NET User Group
Gold Coast .NET SIG
GotDotNet
International .NET Association
Melbourne .NET User Group
MSDN Update (Australia)
Newcastle .NET User Group
Orkut
Queensland MSDN User's Group
Sydney .NET Users Group
Sydney Deep .NET User Group
The Australian Developers.NETwork
Wollongong .NET Users Group

.NET Tools

NAnt
NAntContrib
NHashcash
NProf
NUnit

Developer Resources

Distractions

Addicting Games
CollegeMix Games
FlashGames247
Flying Pig Game
Guess-the-Google
rathergood.com
Smashing Games
Squares 2
The Meatrix
Yeti Sports

Morning Ritual

ACTION Buses
Weather Radar

My Projects

NHashcash
Shrinklet

Readify Bloggers

Bill Chesnut
Chris Hewitt
Dan Green
Darren Neimke
Grant Holliday
Greg Low
Joseph Cooney
Kim Peacocke
Luke Drumm
Martin Granell
Scott Baldwin

Login

Username:

Password:

Remember Me

Windows Forms 2.0 Source Code

Shawn Burke from the Windows Forms team posted up a few days ago about the possibility of Microsoft deciding to ship the Windows Forms 2.0 source code (thanks for the link Joseph). I’m really excited by the possibility of being able to look at the code both to support debugging but also from an educational point of view. I’m most excited by getting the opportunity to look at some of the designer code that is baked into the framework. I wonder if the source code released would include snippets from System.Drawing.dll, System.Drawing.Design.dll and System.Design.dll as well.

But (there is always a but), Shawn needs to get the idea past LCA and that means a discussion not only about the code that ends being compiled into the framework but the comments which exist in the source code.

When developing software we sometimes get frustrated at ourselves, and others and this might lead us to leave a few choice comments in source code. If this code is indexable via Google or freely downloadable you might want to clean the code up a little bit.

In general there are three approaches for cleaning up comments:

Strip all the comments automatically.
Strip inappropriate comments manually.
Strip inappropriate comments automatically with a quick manual check.

Stripping out all non-executable comments is fairly trivial, especially when comments consume the entire line inside the source file – if they don’t you need to do some fairly basic source parsing to figure out if a section of text is a comment or not.

The manual approach really isn’t going to work. It would take forever to scrub the 500,000 lines of code especially when to extend the search beyond basic profanity to off jokes and customer references. You also need to consider the naming of variables, private fields and private methods which may not be appropriate.

The way I would do it is come up with a quick tool (“codecop” anybody) which reads in each source file in its entirety and did a rule-based search across code and comments for anything off colour.

A dumb version of codecop could do only two things, warn by inclusion, or warn by exclusion, meaning that it would either look for things that it did have in some kind of list or look for things that it didn’t have in its list. If it was possible to get a parse tree of the source file a more intelligent search could be done.

In the case of non-executable comments the search tool could automatically remove them if they aren’t appropriate. Instead of just deleting them it would need to replace them with whitespace (the whole region of comments, not just the offending word), this would allow the code sweep to be done after the compile and still keep the PDB source file references inline so executed code could be highlighted in the debugger.

Executable code that couldn’t be cleaned automatically could be reported back (the notification system could be configured to log directly into the bug tracking database at Microsoft) or to some unfortunate contractor who is given the job to vet the code manually. Given that this could be an ongoing process you’d probably want to patch the code up so you didn’t have to keep correcting the same problems.

Its a great idea that presents some interesting problems – and I doubt that Microsoft is the first to try and tackle them.

posted on Saturday, February 05, 2005 8:19 PM

Comments

# re: Windows Forms 2.0 Source Code

Alex Lowe

This isn't new to Microsoft either. I mean, Microsoft has released code to the public, partners, ISVs, etc. so the issue of scrubbing the comments in code is definitely now new. I'm not in the Developer Division and I've never looked for any automated solutions but I'll bet one already exists inside Microsoft given the amount of code that has been scrubbed over time.

Posted @ 2/5/2005 10:00 PM

# So true...

CogitativeMind

Posted @ 2/7/2005 8:06 AM

# Winforms source-code comments redux

JCooney.NET

Posted @ 2/7/2005 10:34 AM