Testers are often caught into the trap of analyzing just the GUI interface and not looking beyond it. It is helpful many a times to go beyond the black box view and explore a little further. This is typically a grey area between black box and white box, wherein a tester tries to understand the application a little further without access to the source code.
One such area is testing a software by modifying/corrupting the contents of associated files. This is ideally called “File Fuzzing”. In this post, I’ll pick up an interesting case of testing by corrupting a blank file.
Yes! You read it correctly! – Corrupting the contents of a blank file! You might think how can a blank file have contents? Let’s jump directly into a practical example.
Microsoft Excel supports publishing “.xls” files (there are others too, but let’s stick to this and let’s not think about how many different versions of xls it supports). So, an xls file is an associated file for MS Excel software. Let’s see what a “blank” xls file might look like.
To create a blank xls file or empty xls file:
- Right click in the Windows Explorer and click “New->Microsoft Excel Worksheet” OR
- Open the MS Excel interface from Programs and click File->Save As.. and give a name of choice.)
I find the first one quicker. So, here goes the snapshot of how it looks in windows Explorer:
![]()
A blank file? Still 12 KB!! More on this later, first let’s see this file in MS Excel:

So, it’s essentially a “blank” file as far as its purpose as a spreadsheet is concerned. There’s no data.
But did you get a hint? It’s still a 12 KB file. There’s definitely some data in it. Let’s open it in a hex editor and have a look:

So, you can see hundreds of rows of data. This data is used by MS Excel to render the file. When you edit the data, this is added in the proprietary excel format.
Coming to testing front, this sequence of bytes may not make any sense to you till you start looking at the Excel format. These values will typicially be a sequence of records wherein each record contains a sequence of bytes telling its type, length and the data contained in it. Software reads the file parses it fully to render it (or parses, renders, parses, renders and so on…).
As you can think of, there can be infinite number of test cases for these values e.g. around boundary values of each record length allowed by software and the one constrained by the number of bytes/bits.
I took Excel as just an example, you can try this any multimedia file, office productivity suite file, archive file and you will find similar observations – the key is to locate such files for your software.
So, next time someone asks how many test cases can you conduct by utilizing an “empty” file, you know what the answer should be!
Rahul Verma
Site Admin, www.testingperspective.com
Leave a reply to richa Cancel reply