Home | About | RSS Feed | Contact and Publicity Guidelines | Comment Policy the Law, the Universe, and Everything 

Search


Concurring Opinions is a
general-interest legal blog
operated by Concurring
Opinions LLC, a Pennsylvania
Limited Liability Corporation.

jr_114_9780195367195_bnr

jr_114_9780195383768_bnr

advertise-here4


FC-CO(SS)

Our Podcast

Subscribe to Law Talk

law-rev-contents2.jpg


  • Posts by Author

  • Categories

  • Archives


  • Recent Comments

    • Mike Zimmer on From the other side at AALS . . .

    • Mike Zimmer on The Employer’s Strategy in Gross v. FBL Financials

    • Mike Zimmer on Drafting the 28th Amendment

    • M.G.M on Drafting the 28th Amendment

    • A.J. Sutter on Lawyers: Don’t Trade on Inside Information!

    • No Load Funds on Consumer Financial Product Safety?

    • grad student on Princeton and the Behavioral Revolution

    • Anon321 on The Passive Voice in Statutory Interpretation

    • Steven Kaminshine on The Employer’s Strategy in Gross v. FBL Financials

    • Alex Kreit on Politicians: Have you talked to your constituents about drug policy?

    • Alex Kreit on Election Night 2009

    • mikeb302000 on Election Night 2009

    • Neal Goldfarb on The Passive Voice in Statutory Interpretation

    • Orin Kerr on Politicians: Have you talked to your constituents about drug policy?

    • MYarnell on Curricular Reform Revisited

  •  

    Site Meter

Replicability, Exam Grading, and Fairness

posted by Dave Hoffman

Exam-Grade-2a.jpgWhat does it mean to grade fairly?

At my law school, and presumably elsewhere, law students aggrieved by a grade can petition that it be changed. Such petitions are often granted in the case of mathematical error, but usually denied if the basis is that on re-reading, the professor would have reached a different result. The standard of review for such petitions is something like “fundamental fairness.” In essence, replicability is not an integral component of fundamental fairness for these purposes.

Law students may object to this standard, and its predictable outcome, asserting that if the grader can not replicate his or her outcomes when following the same procedure, then the total curve distribution is arbitrary. On this theory, a student at the least should have the right to a new reading of their test, standing alone and without the time-pressure that full-scale grading puts on professors.

To which the response is: grading is subjective, and not subject to scientific proof. Moreover, grades don’t exist as platonic ideals but rather distributions between students: only when reading many exams side by side can such a ranking be observed. We wouldn’t even expect that one set of rankings would be very much like another: each is sort of like a random draw of a professor’s gut-reactions to the test on that day.

This common series of arguments tends to engender cynicism among high-GPA and low-GPA students alike. To the extent that law school grading is underdetermined by work, smarts and skill, it is a bit of a joke. The importance placed on these noisy signals by employers demonstrates something fundamentally bitter about law – the power of deference over reason.


This is an old debate, made more salient today by my recent experiences at the AELSC One the major messages of the ELS proponents was replicability: put your data online; record your methods (even your syntax); and make sure that further researchers can recreate your results. This raised for me the question of what is an empirical legal studies approach to grading?

My grading method uses an Excel matrix. Although I sometimes say that “no one can be told what the Matrix is,” in practice this is probably untrue. Basically, I break up each exam question into its constituent issues, award points for spotting and discussing that issue, and move on. I do not have much room in my spreadsheet for novel attacks on problems (which is a problem absent notice), but I do try to correct for outlier issues. Each issue spotting test has something like 50-70 issues to catch.

There are major advantages to this method, primarily, that it allows students to see exactly how and why they lost points during exam review. However, I doubt that the scores awarded are particularly replicable (although I hope that my overall ranking of students would be fairly robust.) That is to say, my grading of law school exams essays is more art that science.

Is this bad? ELS might suggest reasons to think so, as would a fairly traditional behind-the-veil analysis. So, perhaps professors have the duty to make sure (through data diagnosis) that their grading is replicable. One way to do so would be to do blind re-grades of randomly selected exams; another would be to have two sets of graders; another would be to change to a multiple-choice framework. These are all fairly drastic or labor intensive solutions. I might support them, but I’m not yet fully convinced of the underlying claim. What do others think? Does good grading practice mean replicability? Transparency with respect to the data? Auditing? Or are those quantitative principles inapt when considering essay tests? (Another way to think about this is that the mistake is mine when I reduce each issue to a number of points. Better to consider the test as a whole?)


 February 12, 2007 at 12:43 pm   Posted in: Empirical Analysis of Law, Law School (Teaching)   Print This Post Print This Post

Responses (3)

  1. Jeff Lipshaw - February 12, 2007 at 3:02 pm

    Your method sounds a lot like mine. I don’t think it’s a good idea to look at the test as a whole. To the point about relative scores versus objective scores, I do one question at a time without looking at the student’s scores on preceding questions, and try, if possible, to do all of a question at one sitting. And I reverse the stack of exams on each question. It gives the student a fair shake on each question without preconceived notions. And I find that it still shakes out into a curve – by and large, the people who get As have done well consistently across all the questions.

  2. KipEsquire - February 12, 2007 at 4:30 pm

    In the spirit of “interdisciplinary studies,” you might find this post on “grades and Wall Street” interesting. ;-)

  3. Belle Lettre - February 12, 2007 at 7:10 pm

    Speaking as both a graduate law student and an aspiring prof who has graded hundreds of exams as a TA, I say cynically that a bona fide attempt at fairness is requisite–but it’s the apearance of fairness that for some reason matters.

    Jeff’s grading system is standard, most professors in other disciplines (I speak of political science, sociology, English lit) request that you do so. And Dave, your grading system “appears” fair; that is you have created a set of metrics by which each student will be judged. Question is, do metrics ensure objectivity? The complaint about the law, social sciences and humanities from science-trained students is that we’re so subjective in our grading. Do we get around subjectivity by creating “objective” indicia? Or is everything too subjective–the framing of the question, the ranking of of the issues to spot–to get around?

    I kind of like the subjectivity of the law, there are some obvious right things to spot but it’s in the “massaging of the facts” and the application of the law to them as my Civ Pro prof said–that we find the answers. My co-blogger Jeff Harrison at Money-Law happens to think multiple choice questions are easy outs for faculty, and as a student I can say I hate them. They just bug me.

    I’m a bit disturbed by this retro kick to objectivise the law and try to create a metric for everything–aren’t we supposed to be old school legal realists? (Leiter has declared ELS to not be the New Legal Realism, it’s law + stats). But I do understand the end goal of fairness and measurability, I’m just wary of the methods. It’s all about methods in ELS.

    But for more pointed grading strategies, in every class I’ve graded (six so far) we made a key (like you, Dave) and we took a few models answers of A, B, C, D range papers. If 2-3 are of consistent quality per range, they’re good models as you go along for the rest. And I’ve found that when I type up comments students seem to think that lends validity and credibilty to my assessment.

Leave a Reply

*
To prove you're a person (not a spam script), type the security word shown in the picture. Click on the picture to hear an audio file of the word.
Click to hear an audio file of the anti-spam word


  • « Previous post
  • Next post »

Authors

Daniel J. Solove

Website
Understanding Privacy

Kaimipono Wenger

Website
SSRN Page

Dave Hoffman

Website
SSRN Page

Nate Oman

Website
SSRN Page

Frank Pasquale

Website
SSRN Page

Deven Desai

Website
SSRN Page

Danielle Citron

Website
SSRN Page

Lawrence Cunningham

Website
SSRN Page

Sarah Waldeck

Website
SSRN Page

Jaya Ramji-Nogales

Website
SSRN Page

Solangel Maldonado

Website
SSRN Page

Gerard Magliocca

Website
SSRN Page


Guests

Rachel Godsil
Alex Kreit
Anita Krishnakumar
Matthew Sag
Michael Zimmer






Previous Guests

Michael Abramowicz
Michelle Adams
Robert Ahdieh
Michelle Anderson
Laura Appleman
Ann Bartow
Francesca Bignami
Jeremy Blumenthal
Kathleen Boozang
Bruce Boyden
Donald Braman
Al Brophy
Neil H. Buchanan
Bill Burke-White
Scott Burris
Paul Butler
Naomi Cahn
Anupam Chander
Miriam Cherry
Jack Chin
Jennifer Collins
Allison Danner
Brannon Denning
Deven Desai
Mike Dimino
Mark Edwards
David Fagundes
Christine Haight Farley
Kim Ferzan
Dan Filler
Michael Froomkin
Amanda Frost
Timothy Glynn
Rachel Godsil
Eric Goldman
David Gray
Craig Green
Tristin Green
Jeffrey Harrison
Erica Hashimoto
Carissa Hessick
Laura Heymann
Robert Hillman
Christine Hurt
Darian Ibrahim
John Ip
Kevin Johnson
Dan Kahan
Brian Kalt
Sam Kamin
Michael Kang
Chimène Keitner
Orin Kerr
Nancy Kim
Heidi Kitrosser
Adam Kolber
Russell Korobkin
Anita S. Krishnakumar
Susan Kuo
Greg Lastowka
Sarah Lawsky
Erik Lillquist
Jeff Lipshaw
Jonathan Lipson
Jacqueline Lipton
Joseph Liu
Michael Madison
Solangel Maldonado
Jason Mazzone
Linda McClain
William McGeveran
Salil Mehra
Carrie Menkel-Meadow
Max Minzner
Scott Moss
Eric Muller
Jaya Ramji-Nogales
Helen Norton
Elizabeth Nowicki
Paul Ohm
Michael O'Shea
David Opderback
Kristen Osenga
Rafael Pardo
Marcy Peek
Eduardo Peñalver
Robert Percival
David Post
Shruti Rana
Geoffrey Rapp
Neil Richards
Lori Ringhand
Alice Ristroph
Susan Scafidi
Paul Secunda
Jonathan Siegel
Jessica Silbey
Peter Smith
Charles Sullivan
Rick Swedloff
Steph Tai
Andrew Taslitz
Robert Tsai
Jenia Turner
Steve Vladeck
Sarah Waldeck
Melissa Waters
Alfred Yen
David Zaring
Timothy Zick
Spencer Weber Waller
Howard Wasserman
Frank Wu
Corey Yung
Jonathan Zittrain

Blogroll

Above the Law
ACS Blog
Althouse
Balkinization
Becker-Posner Blog
BlackProf
BoingBoing
Chicago Law Faculty Blog
Conglomerate
CrimLaw
Crime & Federalism
CrimProf Blog
Crooked Timber
Discourse.net
Dorf on Law
Election Law
Emergent Chaos
The Faculty Lounge
Feminist Law Profs
43(B)log
Freakonomics Blog
Freedom to Tinker
Google Blogoscoped
How Appealing
Ideoblog
Info/Law
Instapundit.com
Juris Novus
Jurisdynamics
Law and Humanities Blog
Law and Letters
Law Librarian Blog
Legal Profession Blog
Legal Theory Blog
Legal Times Blog
Leiter Reports
Brian Leiter's Law School Reports
Lessig Blog
Madisonian Theory
Media Law Blog
Mirror of Justice
The Moderate Voice
National Security Advisors
Opinio Juris
Point of Law
PrawfsBlawg
ProfessorBainbridge.com
Property Prof Blog
Red Tape Chronicles
The Right Coast
Schneier on Security
SCOTUSBlog
Security Dilemmas
Sentencing Law and Policy
Simple Justice
Sivacracy.net
The Situationist
Susan Crawford
TalkLeft
Talking Points Memo
TaxProf Blog
Tech & Marketing Law
Truth on the Market
Volokh Conspiracy
WorkPlace Prof Blog
WSJ Law Blog
Wonkette
The Yin Blog


© Concurring Opinions

Powered by WordPress