
Scientific, by Brittany G
I’m currently reading The Black Swan: The Impact of the Highly Improbable by Nassim Nicholas Taleb. It’s a refreshing and humbling account of our inability to understand randomness. Two of the arguments in the book stuck me as being very relevant to assessment.
The first of these is an urgent reminder of what Karl Popper taught us about the scientific method: A million confirmations of a thesis will not yield us much information, as we might have just missed the million-and-first circumstance in which the thesis does not hold. However a single falsification of the thesis will establish that is is untrue. And so we can only really learn through falsification, not justification.
A summative assessment is really a test of the hypothesis that our learners are competent. With this in mind, good scholarly practice dictates that we test this hypothesis by finding possible weaknesses in it, and attempting to expose them. So maybe we should not be looking for learners to demonstrate their competence, but we should be attempting to let them reveal any incompetence they might have.
With this in mind, there is perhaps reason to question certain assessment practices such as negotiated assessments, essays and portfolios. These all allow the learner to demonstrate competence largely on their own terms, without allowing us to test for incompetence on ours. It is the learner who takes the lead here in what to put on display for our prying eyes, which also provides the opportunity to perhaps not display the more questionable areas of their attainment. While from a motivational standpoint this has many benefits, it does seem to contradict the basic principles of the scientific method.
There are other approaches that seem better aligned with this principle. Exams are an obvious choice. But I also think a well designed problem based assessment can work in accordance with scientific principles, as long as the assignment is designed with the right pitfalls and challenges that could trigger falsification. A viva voce also provides an excellent opportunity for falsification, provided that the examiners question the candidate appropriately.
Is this something that should inform our practice? Is an assessment regime without a significant component focussing on falsification actually valid, or are we subjecting ourself to a fallacy?
The second topic links to how our behaviour is very domain specific, and what this might imply for assessment practice. I will save that for another post.