Home | About | RSS Feed | Contact and Publicity Guidelines | Comment Policy the Law, the Universe, and Everything 

advertise-here4


Slip Opinions


Groundhog Day. (fp)

Banned in Tucson. (kw)

The Best and Worst of 2011 in Race and Law (kw)

Tortured to death for trespassing. (fp)

Drones of contention. (fp)

DOJ still coddling banks. (fp)

Creative destruction? Thank banks. (fp)

Blog about a new book, on how to talk to little girls--stressing smarts not cutes.   LAC

Macey on the heroic Rakoff. (fp)

Captured NY Fed. (fp)


solicitors

Our Podcast

Subscribe to Law Talk

law-rev-contents2.jpg


  • Posts by Author

  • Categories

  • Archives


  • Recent Comments


    • feathered_head on Physical Punishment and Parental Rights

    • Concernicus on Physical Punishment and Parental Rights

    • Ian on Physical Punishment and Parental Rights

    • Peterk on Physical Punishment and Parental Rights

    • Robert on Physical Punishment and Parental Rights

    • Three Oranges on Physical Punishment and Parental Rights

    • Paul Robichaux on Physical Punishment and Parental Rights

    • JR on Physical Punishment and Parental Rights

    • Jan on Physical Punishment and Parental Rights

    • Mark on Physical Punishment and Parental Rights

    • Shag from Brookline on Omelets and Eggs

    • Shag from Brookline on Omelets and Eggs

    • Joe on What Exactly is Wrong With Polygamy?

    • Phil on What Exactly is Wrong With Polygamy?

    • Lee on Lifecycles and the Firm
  •  

    Site Meter

    About the Blog

    Concurring Opinions is a multiple authored, general interest legal blog.

    (Image: Wikicommons)

Replicability, Exam Grading, and Fairness

posted by Dave Hoffman

Exam-Grade-2a.jpgWhat does it mean to grade fairly?

At my law school, and presumably elsewhere, law students aggrieved by a grade can petition that it be changed. Such petitions are often granted in the case of mathematical error, but usually denied if the basis is that on re-reading, the professor would have reached a different result. The standard of review for such petitions is something like “fundamental fairness.” In essence, replicability is not an integral component of fundamental fairness for these purposes.

Law students may object to this standard, and its predictable outcome, asserting that if the grader can not replicate his or her outcomes when following the same procedure, then the total curve distribution is arbitrary. On this theory, a student at the least should have the right to a new reading of their test, standing alone and without the time-pressure that full-scale grading puts on professors.

To which the response is: grading is subjective, and not subject to scientific proof. Moreover, grades don’t exist as platonic ideals but rather distributions between students: only when reading many exams side by side can such a ranking be observed. We wouldn’t even expect that one set of rankings would be very much like another: each is sort of like a random draw of a professor’s gut-reactions to the test on that day.

This common series of arguments tends to engender cynicism among high-GPA and low-GPA students alike. To the extent that law school grading is underdetermined by work, smarts and skill, it is a bit of a joke. The importance placed on these noisy signals by employers demonstrates something fundamentally bitter about law – the power of deference over reason.


This is an old debate, made more salient today by my recent experiences at the AELSC One the major messages of the ELS proponents was replicability: put your data online; record your methods (even your syntax); and make sure that further researchers can recreate your results. This raised for me the question of what is an empirical legal studies approach to grading?

My grading method uses an Excel matrix. Although I sometimes say that “no one can be told what the Matrix is,” in practice this is probably untrue. Basically, I break up each exam question into its constituent issues, award points for spotting and discussing that issue, and move on. I do not have much room in my spreadsheet for novel attacks on problems (which is a problem absent notice), but I do try to correct for outlier issues. Each issue spotting test has something like 50-70 issues to catch.

There are major advantages to this method, primarily, that it allows students to see exactly how and why they lost points during exam review. However, I doubt that the scores awarded are particularly replicable (although I hope that my overall ranking of students would be fairly robust.) That is to say, my grading of law school exams essays is more art that science.

Is this bad? ELS might suggest reasons to think so, as would a fairly traditional behind-the-veil analysis. So, perhaps professors have the duty to make sure (through data diagnosis) that their grading is replicable. One way to do so would be to do blind re-grades of randomly selected exams; another would be to have two sets of graders; another would be to change to a multiple-choice framework. These are all fairly drastic or labor intensive solutions. I might support them, but I’m not yet fully convinced of the underlying claim. What do others think? Does good grading practice mean replicability? Transparency with respect to the data? Auditing? Or are those quantitative principles inapt when considering essay tests? (Another way to think about this is that the mistake is mine when I reduce each issue to a number of points. Better to consider the test as a whole?)


 February 12, 2007 at 12:43 pm   Posted in: Empirical Analysis of Law, Law School (Teaching)   Print This Post Print This Post

Responses (3)

  1. Jeff Lipshaw - February 12, 2007 at 3:02 pm

    Your method sounds a lot like mine. I don’t think it’s a good idea to look at the test as a whole. To the point about relative scores versus objective scores, I do one question at a time without looking at the student’s scores on preceding questions, and try, if possible, to do all of a question at one sitting. And I reverse the stack of exams on each question. It gives the student a fair shake on each question without preconceived notions. And I find that it still shakes out into a curve – by and large, the people who get As have done well consistently across all the questions.

  2. KipEsquire - February 12, 2007 at 4:30 pm

    In the spirit of “interdisciplinary studies,” you might find this post on “grades and Wall Street” interesting. ;-)

  3. Belle Lettre - February 12, 2007 at 7:10 pm

    Speaking as both a graduate law student and an aspiring prof who has graded hundreds of exams as a TA, I say cynically that a bona fide attempt at fairness is requisite–but it’s the apearance of fairness that for some reason matters.

    Jeff’s grading system is standard, most professors in other disciplines (I speak of political science, sociology, English lit) request that you do so. And Dave, your grading system “appears” fair; that is you have created a set of metrics by which each student will be judged. Question is, do metrics ensure objectivity? The complaint about the law, social sciences and humanities from science-trained students is that we’re so subjective in our grading. Do we get around subjectivity by creating “objective” indicia? Or is everything too subjective–the framing of the question, the ranking of of the issues to spot–to get around?

    I kind of like the subjectivity of the law, there are some obvious right things to spot but it’s in the “massaging of the facts” and the application of the law to them as my Civ Pro prof said–that we find the answers. My co-blogger Jeff Harrison at Money-Law happens to think multiple choice questions are easy outs for faculty, and as a student I can say I hate them. They just bug me.

    I’m a bit disturbed by this retro kick to objectivise the law and try to create a metric for everything–aren’t we supposed to be old school legal realists? (Leiter has declared ELS to not be the New Legal Realism, it’s law + stats). But I do understand the end goal of fairness and measurability, I’m just wary of the methods. It’s all about methods in ELS.

    But for more pointed grading strategies, in every class I’ve graded (six so far) we made a key (like you, Dave) and we took a few models answers of A, B, C, D range papers. If 2-3 are of consistent quality per range, they’re good models as you go along for the rest. And I’ve found that when I type up comments students seem to think that lends validity and credibilty to my assessment.

Leave a Reply

Spam protection by WP Captcha-Free


  • « Previous post
  • Next post »

Authors

Daniel J. Solove
Kaimipono Wenger
Dave Hoffman
Frank Pasquale
Deven Desai
Danielle Citron
Lawrence Cunningham
Sarah Waldeck
Jaya Ramji-Nogales
Solangel Maldonado
Gerard Magliocca

Guests

Derek Bambauer
Gabriella Coleman
andré douglas pond cummings
David Gray
Brishen Rogers
Joseph Turow
Elizabeth A. Wilson













Previous Guests

Michael Abramowicz
Michelle Adams
Robert Ahdieh
Marvin Ammori
Michelle Anderson
Laura Appleman
Taunya Lovell Banks
Ann Bartow
Steven Bellovin
Adam Benforado
Gaia Bernstein
Francesca Bignami
Josh Blackman
Joseph Blocher
Jeremy Blumenthal
Kathleen Boozang
Bruce Boyden
Donald Braman
Al Brophy
Neil H. Buchanan
Bill Burke-White
Scott Burris
Paul Butler
Ryan Calo
Naomi Cahn
Anupam Chander
Miriam Cherry
Jack Chin
Glenn Cohen
Jennifer Collins
Caroline Mala Corbin
Thomas Crocker
Allison Danner
Brannon Denning
Deven Desai
Mike Dimino
Mark Edwards
Maxine Eichner
Jessica Erickson
David Fagundes
Lisa Fairfax
Joshua Fairfield
Christine Haight Farley
Kim Ferzan
Dan Filler
Mary Anne Franks
Michael Froomkin
Amanda Frost
Brian Frye
Timothy Glynn
Rachel Godsil
Eric Goldman
Kyle Graham
David Gray
Craig Green
Tristin Green
Jonathan Hafetz
Meredith Harbach
Michelle Harner
Jeffrey Harrison
Hosea Harvey
Erica Hashimoto
Jennifer Hendricks
Carissa Hessick
Laura Heymann
Robert Hillman
Gilbert A. Holmes
Nicole Huberfeld
Christine Hurt
Darian Ibrahim
Sherrilyn Ifill
John Ip
Shavar Jeffries
Kevin Johnson
Kristin Johnson
Jeff Jonas
Courtney Joslin
Dan Kahan
Jeffrey Kahn
Brian Kalt
Sam Kamin
Michael Kang
Chimène Keitner
Alicia Kelly
Orin Kerr
Nancy Kim
Heidi Kitrosser
Adam Kolber
Russell Korobkin
Alex Kreit
Anita S. Krishnakumar
Susan Kuo
Greg Lastowka
Sarah Lawsky
Youngjae Lee
Margaret Lewis
Erik Lillquist
Jeff Lipshaw
Jonathan Lipson
Jacqueline Lipton
Matthew Lister
Joseph Liu
Michael Madison
Kevin Noble Maillard
Solangel Maldonado
Jason Mazzone
Linda McClain
William McGeveran
Salil Mehra
Carrie Menkel-Meadow
Max Minzner
Viva Moffat
Scott Moss
Eric Muller
Jaya Ramji-Nogales
Helen Norton
Elizabeth Nowicki
Paul Ohm
Angela Onwuachi-Willing
Michael O'Shea
David Opderback
Kristen Osenga
Rafael Pardo
Marcy Peek
Eduardo Peñalver
Robert Percival
Michael J. Pitts
Marc Poirier
David Post
Amanda Pustilnik
Shruti Rana
Geoffrey Rapp
Neil Richards
Lori Ringhand
Alice Ristroph
Marc Roark
Sasha Romanosky
Tuan Samahon
Susan Scafidi
David Schraub
Paul Secunda
Jonathan Siegel
Jessica Silbey
Peter Smith
Judd Sneirson
Adam Steinman
Charles Sullivan
Rick Swedloff
Olivier Sylvain
Steph Tai
Andrew Taslitz
Robert Tsai
Jenia Turner
Steve Vladeck
Ari Waldman
Spencer Weber Waller
Howard Wasserman
Melissa Waters
Frank Wu
Alfred Yen
Corey Yung
David Zaring
Timothy Zick
Michael Zimmer
Jonathan Zittrain

Ownership

Concurring Opinions is a
general-interest legal blog
operated by Concurring
Opinions LLC, a Pennsylvania
Limited Liability Corporation.

Blogroll

Above the Law
Access to Justice
ACS Blog
Althouse
Balkinization
Becker-Posner Blog
BlackProf
BoingBoing
Chicago Law Faculty Blog
Conglomerate
CrimLaw
Crime & Federalism
CrimProf Blog
Crooked Timber
Derechoalderecho
Discourse.net
Dorf on Law
Election Law
Emergent Chaos
The Faculty Lounge
Feminist Law Profs
43(B)log
Freakonomics Blog
Freedom to Tinker
Google Blogoscoped
How Appealing
Ideoblog
Info/Law
Instapundit.com
Juris Novus
Jurisdynamics
Just Books
Law and Humanities Blog
Law and Letters
Law Librarian Blog
Legal Profession Blog
Legal Theory Blog
Legal Times Blog
Leiter Reports
Brian Leiter's Law School Reports
Lessig Blog
Madisonian Theory
Media Law Blog
Mirror of Justice
The Moderate Voice
National Security Advisors
Opinio Juris
Point of Law
PrawfsBlawg
ProfessorBainbridge.com
Property Prof Blog
Red Tape Chronicles
The Right Coast
Schneier on Security
SCOTUSBlog
Security Dilemmas
Sentencing Law and Policy
Simple Justice
Sivacracy.net
The Situationist
Susan Crawford
TalkLeft
Talking Points Memo
TaxProf Blog
TeachPrivacy Blog
Tech & Marketing Law
Truth on the Market
Volokh Conspiracy
WorkPlace Prof Blog
WSJ Law Blog
Wonkette
The Yin Blog


© Concurring Opinions

Powered by WordPress