You are viewing the EdNews Blog archives.
These archives contain blog posts from before June 7, 2011
Click here to view the new Voices section of EdNews

Archive for the ‘Teacher evaluation’ Category

Fix SB 191 before it dies

Tuesday, November 30th, 2010

Robert Reichardt is director of the Center for Education Pollcy Analysis, School of Public Affairs, at the University of Colorado at Denver.

Senator Mike Johnston’s Bill on educator effectiveness (SB 191) is slowly dying, weighed down by its technical impossibilities.  The absurdity of rating teachers on student growth when no measures of growth exist for two-thirds of teachers is an insurmountable roadblock.

This technical impossibility has gummed up the work of the Council for Educator Effectiveness. The lost time and effort will be greatly magnified if we push the implementation of the current law down to districts and schools. Imagine what you would do as a principal if you were told you had to rate teachers using a statistical measure with little or no relation to actual “true” value. A good principal would delay, distract, and obfuscate as she worked to keep teachers focused on teaching.

This is not to say SB 191 does not have valuable components.  Some parts should be implemented without modification or delay:

  1. Districts should implement new teacher evaluation systems by 2013-14 that have at least three levels: Highly effective (top 10 percent of teachers), effective and ineffective (bottom 10 percent of teachers).
  2. Districts should use the information from evaluations to inform decisions about compensation, hiring, placement, promotion, professional development, and retention.
  3. Probationary teachers should have three years of good evaluations to become non-probationary (i.e. tenured).
  4. Tenured teachers (non-probationary) with low evaluations for two years lose their tenure.
  5. Placement of teachers in schools should be based on the mutual consent of both the school and the teacher.

The changes to growth score component of SB 191 should focus on using good information, when it is available, to evaluate both teachers and systems.  When available, growth data should be used as part of teacher evaluations, along with measures that apply to all teachers such as peer and supervisor observation, as well as student and parental feedback.

At the same time, teacher growth scores should be used as a yardstick to judge evaluation system integrity.  Districts should publicly report the proportion of teachers in tested subjects who fall in the top and bottom 10 percent of growth scores statewide.  At the same time districts should also report the proportion of teachers in non-tested subjects that are evaluated to be highly effective and in-effective.

This public reporting will create pressure on districts for rigor and alignment of evaluation systems.  For example, if districts have relatively more highly effective teachers in non-tested subjects districts will need to explain why their most effective teachers are not in tested subjects.

Developing these evaluation systems is important.  However, this hard work is only one component of an even larger change.  Districts and schools need to embrace continuous improvement to be able to incorporate the information developed through evaluations.  We must push through these technical challenges and keep our eyes on the larger prize of improving instruction and student learning.

Popularity: 9% [?]

Proceed with caution on value-added

Monday, November 29th, 2010

For fairness in blogging, I feel compelled to write about the relatively new Brookings report “Evaluating Teachers: The Important Role of Value-Added” that takes a different perspective on using (value-added) student achievement data than the EPI report (“Problems with the use of student test scores to evaluate teachers”) released a few months ago.

Both studies were co-authored by a large group of distinguished researchers, so there is lots of food for thought in this debate. But let’s be clear that this is not just another case of “top researchers disagree on facts, so I can just ignore all of that confusing research and support what my gut says.”

The EPI researchers point out many flaws in the current technologies of using student value-added achievement data, and therefore recommend against its use in high-stakes decisions (like teacher rewards or firing).  The Brookings researchers agree that there are many flaws in value-added data, but ask the reasonable question “compared to what,” noting that other current methods of evaluating teachers are not very good either.

So it is more an issue of interpretation than what the facts are.

Perhaps the most interesting thing the Brookings researchers do is look at one problem with value added data, the low correlation for a given teacher across years (the same teacher compared in year 1, 2 and 3), with parallel correlations in other domains, like baseball batting averages, insurance salesperson rankings, mutual fund performance rankings, etc.

The relatively low correlation of year-to-year teacher’s rankings (0.3-0.4 range) has been cited by critics as a reason not to use value-added data, since it appears to have too much measurement “noise” to be as accurate as we would like (that is, in a disturbing large number of cases, the same teacher is ranked as “good” one year, and then “bad” the next year.)

The Brookings researchers suggest that the “noise” is not greater than what we see in these other domains and that this is an argument to use value-added for high-stakes decisions, even with its flaws. In some sense, this is argument by analogy, the accuracy of which we should examine.

First we have to think about whether we believe a teacher varies greatly in her performance over time.  Does a teacher have a “true type” (good, bad, average) and our challenge is to sample and measure that (pretty consistent) “type” correctly, or does the teacher actually vary greatly in her ability over time?  (see my earlier blog on this topic.)

If we think the teacher type is relatively “fixed” than the low correlation is a big problem, and it represents “noise” and our inability to sample well enough to find the “true type.”  Personally, that perspective makes more sense than a belief that teacher quality varies greatly from year to year.

Second, let’s examine the analogies more carefully.  Hitting a baseball thrown 60 feet at 90 miles per hour is a notoriously difficult and fickle skill – concentration and mental state do seem to matter a great deal – perhaps also so does the quality of pitching, non-random choices about which hitter to put up against which pitcher (lefty against righty, fastball versus curveball), etc.

It makes sense to me that top hitters one year might be less effective the next year, when they might be facing a divorce, or a contract year, or whatever.  Also, baseball players are well-known to be highly compensated, perhaps partly for the risky elements of their performance over time.

This seems to me a weak analogy to a teacher who has six hours and 180 days a year in a more self-controlled environment to perform “good, bad or average.”

Second, let’s look at mutual fund performance.  Here, from living 15 years in NYC, where the financial markets are eaten for breakfast, I am quite confident that the top investors in boom times (buy more tech stocks in 1997!) are also likely to be among the worst investors in downturns (buy more tech stocks in 2000!) or slow growth periods (when the top performers are cautious, diverse portfolio investors).

Academic studies of financial markets strongly suggest “random walks” and very little likelihood that we should expect high correlations across years in top performers, making this, again a poor analogy.

Third, insurance sales seem similar to financial investment to me – perhaps during good economic times a particular type of salesperson is more easily able to get families to spend money on insurance – it might require a different set of skills to be a high sales performer  in a recession.   Thus, the low correlation is in fact caused by factors outside the salespersons’ control (as with the financial markets, and probably partly in baseball too).

Their best analogy might seem to be the use of ACT/SAT scores for college admission despite a relatively low correlation with student GPAs (and the fact that no other measurable admissions factor has a higher correlation).  While I first found this argument  more compelling, it is severely flawed by the fact that they are now correlating two different things – actual student course achievement and a specific test – not the same thing over multiple years – a statistician would expect more “noise” in correlating two different things.

So it seems that the Brookings researchers can’t have this both ways.  Either you believe teacher type is fairly fixed, and then the low correlation is really a problem with our measurement technologies and a true problem with using the data.

Or you believe that teachers’ quality varies as much as professionals in these other fields, where it seems quite clear that much of this variation is a function of the external environment.   Many teachers have argued that their lack of control over their environment is a major barrier to how much they can move students achievement – the non-random assignment of teachers to groups of students, the variation in students from year to year, variations in other supports in the school, changing curricula, etc.

So where does this leave us in a discussion that is not just theoretical? After all, Colorado’s State Council for Educator Effectiveness is working, even as we blog, on implementing the SB 191 requirement that 50 percent of a teacher’s high-stakes evaluation be based upon student achievement.   Researchers agree that value-added has problems – they disagree about how severe those problems are, and whether focusing too much on these flaws, and not on problems with other forms of evaluation, makes the “perfect the enemy of the good.”

My belief is that there are many current problems with using value-added – some are fixable, with more and better tests, with better administration of tests (to avoid outright cheating and overly teaching to the tests), and with a clear understanding of the challenges created.

But other problems are not easily fixable, and perhaps we don’t want to fix them because they have value within schools – the non-random assignment of students to teachers within a school, the group nature of teaching, especially in higher grades, the difficulty assessing performance improvement in the arts, physical education, and other domains.

These issues suggest using value-added data very carefully, and as only one component in a meaningful, high-stakes teacher evaluation system.

Popularity: 13% [?]

Learning the right lessons

Monday, November 8th, 2010

I have been critical of efforts to jump too quickly into evaluating teachers largely based upon student test score data because our current available tests are poorly aligned for this task, infrequently given, and the statistical technology of tying (relatively small numbers of) specific students to specific teachers is more limited than most advocates seem to realize.

At the same time, I do think we need to evaluate teachers (and other education professionals) much better than we do now, and there is lots of room for improvement since our current systems are so poor (and there is some truth to guarding against making “the perfect the enemy of the good”).  And we need to be clearer about why better evaluation is important, both on the downside (perhaps firing or redeveloping poor performers) and on the upside (rewarding and learning from top performers).

Often, in these discussions, we make assertions about how things are done in the revered private sector and more specifically within business firms.   Often these assertions have no empirical basis.

So it is nice to see a thoughtful new report, from Julie Kowal and Emily Ayscue Hassel of Public Impact, that actually examines the literature on corporate employee evaluation systems, and what public education can learn from that domain.

While most of their conclusions align with common sense, and tend to promote mixed evaluation approaches, they do have some more interesting suggestions.  These include measuring more often, being ready to adjust measures when they don’t associate with organizational success, taking team outcomes seriously, and protecting evaluation from leniency and bias problems.

Popularity: 5% [?]

Elections resound from D.C. to…Kit Carson?

Thursday, November 4th, 2010

The recent election results and their likely effects on education reform, both at the state and national level, deserve a closer look. In Colorado it seems clear enough that marginal gains were made in favor of school choice and tenure/evaluation reform. Four of the five Democrat incumbents defeated by Republicans en route to their new majority — Dennis Apuan, Debbie Benefield, Sara Gagliardi and Dianne Primavera — were SB 191 opponents and significantly supported by teachers unions.

The state senate picture looks like more of a wash, even as races in two districts appear unresolved with ballots still being counted. Senate Majority Leader John Morse voted against the initial version of SB 191 but was one of a number of Democrats to come on board to accept the final House version. His challenger, Republican Owen Hill, likely would push education reform to the Right. Filling the shoes of pro-SB 191 Democrat Dan Gibbs offers a big divide on that vote for education reform. Conservative Republican Tim Leonard would be a staunch supporter of school choice and other reforms, while liberal Democrat Jeanne Nicholson seems to be more aligned with the unions than the retiring officeholder was. Meanwhile, SB 191 opponent Bruce Whitehead was defeated by supporter Ellen Roberts.

At the national level, with a Tea Party surge bringing a new majority and a new dynamic to the U.S. House, a big issue will be the ESEA reauthorization. I tend to agree with Fordham’s Mike Petrilli that the shift entails “less money, less reform” from D.C., as there will be a strong, new focus on trying to devolve federal power — perhaps halting some of the Obama-Duncan initiatives.

That brings me to an overlooked election result from Colorado’s Eastern Plains, that just may have national significance. On Tuesday, voters in the 100-student Kit Carson School District passed a $45,000 mill levy by a roughly 3-to-2 margin. In conservative Cheyenne County. During the big Tea Party wave. What gives? The money will be used to offset the small district’s position (rogue or avant garde, perhaps, depending on your point of view) to refuse federal Title One dollars.

Superintendent Gerald Keefe said receiving the roughly 2 percent of the general fund budget prevents the district from either having to dip into reserves or consolidate elementary classrooms. Still, it’s no small feat in a fiscally frugal rural area not known for backing many tax increases. Opting out of the federal strings was key. “This one is more on principle than anything,” Keefe said.

A few years ago, Keefe received confirmation that his was the only district nationwide that had taken the bold step of bucking federal dollars. To the best of his knowledge and mine, that still remains the case. Though I did find a northern Wisconsin school district that less boldly refused $45 (not $45,000) in allotted Education Jobs funding.

Next up for Kit Carson? A plan to opt out of Title Two dollars (currently used for professional development through BOCES), which would effectively liberate the district from No Child Left Behind’s “highly qualified” teacher requirements. Through either this approach or its Innovation proposal pending before the State Board of Education, Kit Carson is looking for greater flexibility to suit its local personnel needs.

It will be interesting to see how much of a trend Keefe’s school district propels, given the national mood and the latest election results. And it will be interesting to see how the newly configured Congress and Colorado legislature assume their roles in driving (or in standing back from) the next steps in education reform.

Popularity: 11% [?]

Reinventing the wheel (at a Goodyear plant)

Thursday, October 7th, 2010

Cross-posted from the Failing Schools blog

Yesterday, Amy Slowthower shared her well-earned frustrations about the expensive, tiresome process of developing a new teacher evaluation system for DPS. I feel her pain; there’s nary an aspect of school reform where there aren’t people taking forever and a day to come up with some new plan, and squandering millions of dollars as they dither.

What I don’t understand is why we feel it’s necessary for DPS to come up with a new teacher evaluation system in the first place.

When I first learned about Denver’s teacher evaluation system, I was actually very impressed. The current system identifies five performance standards on which teachers are assessed (instruction, assessment, curriculum & planning, learning environment, and professional responsibilities). Each standard has between three and five expectations to be met, suggested indicators for each expectation, and a rubric to gauge whether teachers are not meeting, developing, meeting, or exceeding expectations for each standard. There is a space for teachers to enter self-comments, and their reflections on how they address each professional standard.

There is also a summary of evidence journal administrators are supposed to gather and complete to justify the ratings they give in each area. Before the final evaluation is submitted, teachers and administrators are expected to meet to discuss any discrepancies in their respective perceptions of the teacher’s performance. The meeting also provides teachers an opportunity to offer more evidence (artifacts, student work, input from peers or parents) of what’s going on in their classroom, and receive feedback they can use to immediately improve their practice.

The system isn’t perfect, of course, though as I look over the forms and standards themselves, the only improvement that immediately comes to mind is to differentiate the Comprehensive Performance Rating at the end. Currently, the only options are “satisfactory” and ‘unsatisfactory,” though the information culled from the rest of the evaluation could easily support an overall rating of “not meeting,” “developing,” “meeting,” or “exceeding”, or whatever terms one likes.

The problem isn’t with the system itself, but the people using it.

For instance, on my most recent evaluation, I was meeting or exceeding expectations in all but one indicator under one standard. Yet the evaluator comments are overwhelmingly negative and demonstrably false. In one area where my performance exceeded expectations (assessment), the evaluator comments are similarly inaccurate, and easily disproven with even the quickest glance at my students’ data (yes, that data, that I try not to focus on, and everyone else claims to value so highly!).

Several friends of mine have described similar situations at that and other DPS schools. One DPS teacher even told me he was never observed, but the principal just wrote that he “deemed him to be a competent teacher” and didn’t bother to fill out the rest of the document. That evaluation was enough to grant this third-year teacher non-probationary (“tenured”) status. Some principals don’t bother to have the final meeting, others don’t give teachers their documents in enough time to respond…it’s a mess.

Why does this happen? Because overworked (and/or underhanded) principals know they can put just about anything on those forms and it won’t matter– no one is checking up on them. There is no meaningful oversight in this area– no close reading by the instructional superintendent to check for glaring omissions or inaccuracies, no timely investigation of teachers’ claims of misuse, no central office audits of the accuracy of what has been entered into the record, no nothing. Everything gets signed and kicked to the next location until it eventually lands in a file that’s rarely–if ever– opened. Whatever new system comes out of this current work won’t be any more effective if it suffers the same fate.

The higher-ups at DPS are probably as overwhelmed as the rank-and-file educators they’re supposed to support. But the path to advancement in such a place requires you to be more concerned about keeping up with the latest trendy school reform than stopping to seriously examine and address what’s going on out in the schools. (And why bother with that, anyway? There’s more money to be won from reinventing the wheel than fixing and using the tires you’ve got.)

So they’ve spent ten months, and a few million dollars, working on this. Spend ten years, and a billion dollars if you want. Or don’t. Who cares? Until the “look-busy-and-talk-smooth” culture at 900 Grant changes, it won’t make a whit of difference.

Popularity: 7% [?]

A stimulating dialogue on evaluation

Wednesday, September 8th, 2010

The New York Times is running a great discussion on teacher evaluation.  You can see it here.

The contributors cover a wide range of viewpoints.  Check out the various responses to the discussions.

Popularity: 3% [?]

Teacher evaluation is a sampling process

Thursday, September 2nd, 2010

I appreciate the recent blog discussion on student value-added methods for teacher evaluation, and I hope it can perhaps help bridge some of the “insider/outsider” rift on this topic.

My point in noting problems with value-added test score data is not to derail those efforts but to improve them.   Using these data in some manner is certainly better than having no information at all about how teachers influence student achievement.   But, we have to be careful, because the data have many flaws – they are necessarily only a sample of the “true” quality of a teacher, which is very hard to know, when we don’t observe them, in class for 180 days per year, six hours per day. That, of course, is impossible.

My concern (and a problem with the LA Times publication of such data) is that some people now think we have ironclad, precise data on teacher influence on student achievement, which we can now just plug in to evaluate the teacher.  The recent EPI report, and nearly all other recent research, point out very real problems with using these data, not just overly wonky anxiety.

When you sample, you always have implicit or explicit “confidence intervals” around the estimate.  A principal observing teacher quality via classroom activities is sampling.  If that principal only watches a particular teacher one time during the year, for 10 minutes, that is a very imprecise sample.   That teacher could be terrible most of the time, but the principal happened to catch her on part of a very good day, and (wrongly) writes down that she is an excellent teacher.  Obviously, the more observations a principal does, the more likely that the sample is an accurate reflection of the “true” teacher quality (Mike Miles in Harrison 2 has addressed this well, with 8-16 observations required per year).

(We know that flipping a coin is a 50/50 heads/tails probability, in a large sample.  But flip it 2 times, and the odds are that 25 percent of the time you get two tails, 25 percent both heads, and 50 percent a mix.   So, two random observations of a “good” teacher (“heads”) have a 25 percent chance of believing that she is not a good teacher (“tails”).)

Note that this also requires “random” observation – if the teacher knows the principal is coming to watch, she will obviously improve her performance at that time (a corollary to teaching to the test).

It is equally important to note that using test score data is also a sample of “real student learning.”  If the tests are valid, reliable and measure the learning we want students to have, they are better than tests that don’t have those characteristics (which fits most of our current tests).   But, one test a year (or two, or a few) are only samples of student learning.  If many of a teacher’s students are ill for March CSAPs, the sampling of learning won’t be terribly accurate.  The better the tests, the more tests we give, the more likely the sample is accurate, and in statistical terms, the smaller the confidence interval around the estimate, meaning we could say with more precision that teacher X is very good, and be correct about that.

Once-a-year CSAPs provide very imprecise estimates for individual teachers, which, along with other problems noted in other blog posts, should caution about high stakes use of these data.

Even with these caveats, the data can be used in some basic ways, probably helping to sort teachers into three categories.   I agree with Kevin Welner that we can think about using test score (and other) data to sort teachers into a small category of “excellent teachers” who seem to drive high student achievement over several years and also rank high on principal and/or peer evaluations.  At the other end, consistently low student achievement scores and low observational ratings should identify truly poor teachers.   The vast number of teachers will be in a middle ground, and trying to sort them more precisely will be going well beyond the ability of our sampling techniques.

Popularity: 11% [?]

The shortcomings of value-added modeling

Tuesday, August 31st, 2010

Hello EdNews readers. I’ll be checking in occasionally with blog entries here focused on new and worthwhile research. For this first blog, I want to point you to a research brief published on Sunday by the Economic Policy Institute.

The piece, with the dry but informative title of “Problems with the Use of Student Test Scores to Evaluate Teachers,” is authored by an extremely impressive collection of accomplished researchers. If you read nothing else about education this week, please read the three-page executive summary (then continue on and read the rest!).

Before discussing this research brief, I want to re-introduce myself. I teach school policy and law at the CU Boulder School of Education, where I also direct the Education and the Public Interest Center. In my blog entries here, I will try to point readers to useful resources on the EPIC website in addition to resources – like the new Economic Policy Institute brief – from other places.

The main point of the EPI research brief is straightforward: while value-added modeling (VAM) is a technical advancement that highlights student growth, the numbers generated are nevertheless too inaccurate to be used as a primary factor in making high-stakes decisions about teachers. That is, if someone tells you that a teacher is good or bad based on a VAM calculation, you are wise to take the judgment with a sizeable grain of salt. This is the same warning that I — with far less impressive credentials — issued a couple years ago, as did the National Academy of Sciences earlier this year.

The full EPI research brief does a great job explaining how and why high-stakes VAM policies cannot be supported by VAM itself. But there’s one quote and one illustration/study that I want to pull out of the brief, to hopefully entice you to read the entire thing.

First the quote:

“There is simply no shortcut to the identification and removal of ineffective teachers” (p. 20).

As I write these blog entries throughout the year, I could probably begin each one with, “There is simply no shortcut to…” In part, this reflects the complex nature of schooling, but it also reflects the sad state of policymaking, where politicians and others are so easily enticed by the quick fix.

The replacement of ineffective teachers with effective ones is unquestionably a worthwhile policy goal, but it’s much easier said than done. A policy intended to accomplish this goal would have to reliably (a) identify the ineffective teachers (without wrongly targeting the effective ones), and (b) identify and recruit effective replacement teachers. Also, the policy should accomplish this in a more cost-effective way than alternative possibilities (but given the problems with the first part of this puzzle, we’re not yet at the point where we should worry about such comparisons).

Now for the illustration. I’ll quote from page 2 of the executive summary:

One study found that across five large urban districts, among teachers who were ranked in the top 20% of effectiveness in the first year, fewer than a third were in that top group the next year, and another third moved all the way down to the bottom 40%. Another found that teachers’ effectiveness ratings in one year could only predict from 4% to 16% of the variation in such ratings in the following year. Thus, a teacher who appears to be very ineffective in one year might have a dramatically different result the following year. The same dramatic fluctuations were found for teachers ranked at the bottom in the first year of analysis. This runs counter to most people’s notions that the true quality of a teacher is likely to change very little over time ….

This is scary stuff, but only if used unwisely – only if policy makers give too much credence to the scores. VAM approaches do tell us something; a teacher (or school) whose VAM scores are consistently at the extreme high end of the distribution are very likely of higher quality than those consistently at the extreme low end. So here’s my alternative proposal: use VAM approaches as a first-stage, cost-effective tool that will help inform a more in-depth, second-stage quality analysis. A teacher or school at the bottom (e.g., the bottom 5 percent) in a state or district should be identified for classroom observations, principal evaluation, and other hands-on information-gathering that can lead to a determination of professional development needs or removal/turnover. Similarly, a teacher or school at the top might be identified for further study that might help us learn from successes. This approach has three major advantages:

  1. The ultimate evaluations of teachers and schools will not be made based on the test scores; they will be more thorough and reliable.
  2. The use of VAM here is supportable, since it is not being used to make fine-grained distinctions among teachers, but it does serve as a good tool that allows for a cost-effective use of hands-on evaluation tools.
  3. A teacher will not feel extreme pressure to teacher to the test, since his or her career would ultimately be determined by hands-on observations and other information, not by students’ test scores.

Sadly, the approaches being considered and implemented in Colorado and elsewhere rely far too much on test scores and VAM approaches. We are rushing toward a system of teacher evaluation that is sure to wrongly identify teachers as good or bad, and it will likely be years before policymakers realize and correct the mistake.

Popularity: 12% [?]

Tenure: An idea whose time has gone?

Monday, July 26th, 2010

Megan McArdle’s recent piece in The Atlantic makes this claim.  My favorite part was her response to the argument that it is tenure that allows professors to produce important research:

How about valuable scholarship?  Well, define valuable–in many liberal arts fields, the only possible consumer of the research in question is a handful of scholars in the same field.  That sort of research is valuable in the same way that children’s craft projects are priceless–to their mothers.  Basically, these people are supporting an expensive hobby with a sideline business certifying the ability of certain twenty-year-olds to write in complete sentences.

Another point is equally compelling: tenure is supposed to encourage professors to take risks.  But because the process of applying for and receiving tenure is highly political and consumes one’s early career, it often has the opposite effect: scholars early in their professions, when they are most likely to produce groundbreaking work, are far more risk-averse; by the time tenure is granted, a professor is more definitively committed to a specific academic trajectory with far less chance of groundbreaking research.

Lastly, McArdle points out that the process does not do much for the vast majority of tenure applicants who are not successful:

At the end of the process, most of the aspirants do not have tenure; they have dropped out, or been dropped, at some point along the way.  Meanwhile, the system has ripped up their lives in other ways.  They’ve invested their whole youth, and are back on the job market near entry level at an age when most of their peers have spent ten years building up marketable skills.  Many of them will have seen relationships ripped apart by the difficulties of finding not one, but two tenure-track jobs in the same area.  Others will have invested their early thirties in a college town with no other industry, forcing them to move elsewhere to restart both their careers and their social lives.  Or perhaps they string along adjuncting at near-poverty wages, unable to quite leave the academy that has abused them for so long.

The entire piece is well worth a read, but it also made me think: if the arguments for tenure are so fragile for higher education, why in the world is it a fixture in K-12 education?  (and commenters, please let’s not have a pedantic linguistic fight about “tenure” vs. “non-probationary” – duck, walk, quack, etc.)

The most cogent arguments for tenure in higher education almost uniformly do not apply to K-12.  McArdle, in her discussion of tenure, notes:

I’m sure it’s protected more than one scholar from getting fired after making stupid remarks to a class.  And we would all of us–not just academics–like to be immune from getting fired for making stupid remarks.

Tenure in higher ed can at least appeal – correctly or not — to the importance of of university research. It’s the rare argument that tenure improves undergraduate teaching.  K-12 has no such shield, and the claims that tenure improves student learning seem to me to be even more sparse.  I sure understand why K-12 teachers like the protections of tenure.  I’m just not sure its benefits accrue to anyone else within the system.

8/12 Update: A similar and expanded view in Slate

Popularity: 6% [?]

Educator Council doesn’t need to be interesting

Friday, July 23rd, 2010

On Wednesday I took a detour from the routine to drop in like a fly on the wall and watch the proceedings of the Governor’s Council on Educator Effectiveness. This sort of policy making below the radar often doesn’t get the attention of, say, a legislative hearing. But the detailed definitions and standards to be produced by the Council (and/or to the State Board next year) are a most crucial piece of the implementation of the nationally-acclaimed Senate Bill 191.

The work of the council is a long way from being done. To say that Wednesday resulted in some progress in crafting the definitions of an effective teacher and an effective principal is to make the proceedings sound about as interesting as they can. If you’re looking for interesting, might I recommend reading up on the governor’s race?

More noteworthy is the venue: It’s not every day I go hang out inside CEA headquarters. And despite rumors to the contrary, I wasn’t harassed or assailed (though union officials still gave no sign of adding me to their Christmas card list). To me, the most significant news that came out of this meeting was the fact that all future meetings will be held on neutral turf in the State Capitol. I probably wasn’t the only one to note the problem of gathering the Council on the home turf of a special interest group heavily invested in the debate. At the least, it creates an appearance of conflict.

Anyway, yes, it was somewhat painful to endure an extended morning discussion on the council’s own plans and processes, and schedules of future meetings. Though clearly the pressure rests on the council members themselves and the extent of work to be accomplished before their December and spring deadlines. May it be true that they are coming together, getting on track and moving the ball forward (how’s that for mixed metaphors?).

Can the council complete its charge on time with consensus and in the spirit of the legislation’s goals? I sure hope so, though doubts would grow if their work continues at the pace of this week’s enclave. With all the future meetings on tap, it’s worth paying attention to the outcomes, but I don’t expect to be consistently watching the sausage-making quite so closely from here on out. And not necessarily because its proceedings are less than riveting. Come on… do we really want the council to get interesting?

Popularity: 4% [?]

Colorado Health Foundation Walton Family Foundation Daniels fund Pitton Foundations Donnell-Kay Foundation