Monday, March 3, 2008

Anatomy of a Ph.D. thesis

Let's face it: life is complicated. But thanks to the ever-flourishing DIY industry (for example, WikiHow), a lot of endeavors that used to seem complicated are made much less so through step by step instructions. In science, experimental protocols already do this, at least in theory, but what about other aspects of science, like writing papers, keeping up with literature, making presentations, or networking at conferences? My advisor has given informal talks for his students on a number of these topics, the latest of which was a set of general guidelines for writing a Ph.D. thesis.

In order of their appearance in the final document...
  • Chapter 1 - Introduction. This is essentially an executive summary. You should briefly describe all contributions your thesis makes to your field, provide at least one "gee-whiz" result, and lay out a roadmap for the rest of the thesis ("In Chapter 2, I present the background... In Chapter 3, I discuss my work on X...."). It is acceptable to make claims without proof, since you will be defending these later on.
  • Chapter 2 - Background. This is essentially a literature review, and demonstrates your understanding of the field and the context surrounding your work. For bioinformatics theses, this covers both the biomedical domain and the area of informatics or computation your work involves. You should present an intellectual framework in which your work fits - what has been done, the advantages and limitations of this previous work, the potential avenues for improvement, and where you come in. Ideally, this chapter could be published as a review article with very little modification.
The next couple chapters are the meat of the thesis, and can take at least two forms depending on what kind of work you did during your Ph.D. If you worked on several somewhat disjoint projects and published 2 or 3 papers on them, you can write one chapter for each paper (but no more than 3). If you worked on just one problem, you are probably better off writing a chapter for the methods and a chapter for the results and discussion (if you developed two approaches for the same problem, you can repeat this for the second approach). So:
  • GENERAL THEME: Several projects
    Chapter 3 - Methods, Results, and Discussion from paper 1
    Chapter 4 - Methods, Results, and Discussion from paper 2
    (Chapter 5 - Methods, Results, and Discussion from paper 3, if applicable)
    FOCUSED THEME: Single project

    Chapter 3 - Methods
    Chapter 4 - Results/Discussion
    (Chapters 5 and 6 - Methods and Results/Discussion for approach 2, if applicable)

    In general, you do not want to reuse text from your published papers verbatim, despite how tempting this can be. Papers are very strict and limit what you can express, so you should see your thesis as an opportunity to pontificate and give voice to your ideas. You should also form your thesis into a detailed guide of everything you tried, even some of the things that didn't work, so that it can be a reference to future generations of grad students who may pursue extensions of your research.
  • Chapter 6 or 7, depending on type of thesis - Summary chapter. Describe overall contributions to the relevant domains. (For biomedical informatics theses, describe the overall contributions to biology or medicine, and the overall contributions to informatics or engineering. If applicable, you may also describe core contributions to computer science.) Here is where you also discuss the limitations of the work, the unsolved problems, and your best ideas for how to solve them.

  • Appendices - supplementary material. Almost anything goes, but you should definitely include all key data and datasets (information needed to recreate the major results from your thesis). Ideally, all data relevant to your thesis (and other related work, if possible) will be stored and/or made available either on the web or as a physical copy, though this is mostly for the advisor as a reference for future students. If you have any proofs or supplementary material, these should be in an appendix. You can also include additional work or papers published unrelated to your thesis.
So that explained what each chapter of the thesis should be about; what about actually writing the thesis? My advisor's recommendation is to start with the meat chapters (Ch. 3 - 6/7) since you should have pretty much all the necessary material to begin with, then write Chapter 2, then write the first and last chapters.

More specific advice on how to actually write each chapter was not covered and probably warrants its own post. Note that this is my advisor's take on the Ph.D. thesis; I'm sure there are some other interpretations, which would be interesting to hear! How much does the thesis vary by field?

5 comments:

Jean-Claude Bradley said...

Nice guide! Indeed it is best to start with the results and ideas that are most solid and build from there.

Bill Hooker said...

I think it varies pretty widely by field. My favourite model is the one where you bind your published papers together, slap 'em between an introduction/lit review and a final discussion, and then get on with your life.

Jean-Claude Bradley said...

Bill that is a nice option if you can do it in your field. In organic chemistry at least, the thesis is an opportunity to write up all the "failed experiments" and what you learned from them - of course those don't typically appear in published papers.

Pedro Beltrao said...

What is amazing is the variability also in what is required during the thesis defense from country to country and among the different disciplines.

Anonymous said...

Thanks for sharing this framework, especially the details. Very timely for some of us!

An approach I've heard is to have a chapter/paper on theory, another on application, and a third on evaluation. Applicable to some projects but not all.