Thursday, October 10, 2024

A Dispositionalist Approach to Desire and Valuing

What is it to desire or value something? Is it to feel a certain way? Is it instead to have a certain sort of representational architecture (a stored representation of "X is good" or a representation of X in one's "desire box")? Is it to have a certain type of neurological structure associated with reward and learning?

I hold, instead, that to desire or value something is a matter or being disposed to act and react in a certain characteristic pattern of ways; to desire or value is to have a certain type of habitual posture toward the world. I have long defended a dispositionalist theory of belief (e.g., here and here). In 2013, I extended this dispositionalist theory to "attitudes" generally, explicitly including desiring and valuing. But I have never written a full-length journal article specifically on desire and value. Here's a preliminary sketch of my approach.

Liberal Dispositionalism about Desire

In the 20th century, dispositionalist accounts of attitudes were generally associated with behaviorism: To believe P or desire Q is to be disposed to behave in a particular set of ways. Such accounts fell out of favor as behaviorism fell out of favor. One of the main innovations of 21st-century dispositionalism -- "liberal dispositionalism" as I call it -- was to explicitly put other types of dispositions on equal footing with behavioral dispositions, thus avoiding the troubles that plague behaviorist approaches to the mind.

I favor sorting the dispositions into three broad classes: behavioral, phenomenal (that is, pertaining to conscious experience), and cognitive. Suppose I want my daughter to do well in school. On a liberal dispositional account of desire, to have this desire is neither more nor less than to possess a certain suite of behavioral, phenomenal, and cognitive dispositions. Behaviorally, it is to be disposed, for example, not to interfere with her homework, to encourage her when she does well, to provide the resources she needs to succeed, and so on. Phenomenally, it is to feel good when I hear of her successes, to feel disappointed when she fails, to fantasize happily about good educational outcomes, to feel anxious if she doesn't seem to be putting in the necessary work, and so on. And cognitively, it is to be disposed to enter further mental states under various relevant conditions, such as to engage in certain types of planning and to reject incompatible desires, such as that she drop out of school to pursue a career in fashion.

As in the case of belief, all these dispositions hold only ceteris paribus, that is, other things being equal, or when conditions are normal, absent competing influences. I won't encourage her to do her homework if the house is on fire. And as in the case of belief, few of us will perfectly match any dispositional profile so constructed; it's a matter of whether we match closely enough. A natural comparison is personality traits: To be an extravert is just to match, close enough and ceteris paribus, the dispositional profile constitutive of extraversion: being disposed to enjoy parties, to make friends easily, to take the lead in social situations, and so on.

Every theory of desire will hold that, generally speaking, if one has a desire one also has a certain suite of appropriate dispositions. What is distinctive about dispositionalism is that it says that that is all desire is. Once your dispositional profile is fully characterized, that's the end of the story as far as the existence or non-existence of desire is concerned. Maybe there's some representation in the desire box (if human architecture works a certain way), or maybe the reward system is in some particular state, or maybe you buzz with a certain feeling; or maybe none of that is the case. Such facts, if they are facts, are contingent associations or implementations. Anyone who matches the dispositional profile constitutive of desire to an appropriate degree does desire, regardless of whatever else is true of them; and anyone who does not match that dispositional profile does not desire. If there were space aliens with a radically different cognitive architecture, they would desire if and only if they matched the relevant dispositional profile. Cognitive and physiological architecture is only derivatively important to the metaphysics of desire: It is important only because, and to the extent, it undergirds the dispositional profile.

Short-Term vs. Long-Term Desire

Arguably, there are two very different types of desire: short-term and long-term. I desire (short-term) a beer from my fridge, right now. I also desire (long-term) that my fridge be stocked with beer, in general. Sometimes the first type of desire is called "occurrent" and the second "dispositional". Such occurrent desires plausibly feel a certain way (there's a feeling of craving that beer), while the dispositional desires don't. Maybe only the latter are subject to dispositional analysis.

I reject that view. Short-term and long-term desires can both be analyzed dispositionally and exist on a spectrum rather than being different in kind. The difference lies in the duration of the dispositional structure. If I currently want a beer, I have a suite of dispositions constitutive of that desire: I'm disposed to go the fridge to get it; I would feel disappointed if I discovered I was out of beer; I'm inclined to make a plan to get that beer as soon as the game pauses for commercials. If I lack these dispositions, it's not true that I want a beer. But the dispositions only endure briefly. As soon as I get that beer, they vanish.

Now it might be true that often (even typically) certain feelings tend to accompany short-term desires. But if so, on my view they are signals of the desires or at most the surface manifestations of the dispositional structures constitutive of desire. If the feeling is disconnected from other dispositions, it constitutes a "wraith" of a desire, not a full-blown desire (see Schwitzgebel 2013, sec 11 on "wraiths").

[This kid really wants cake; image source]


In-Between Desire and "Weakness of Will"

Much of my work on belief has emphasized the existence of "in-between believing": cases in which people have some substantial portion of the dispositional profile constitutive of believing some particular proposition but in which they also deviate substantially from that profile, such that it's neither quite right to say they believe nor quite right to say they fail to believe. One plausible case is implicit bias, someone who sincerely affirms (for example) that all the races are intellectually equal, and reasons on that basis in explicit contexts, but who also often acts and reacts as if the races are not all intellectually equal.

Similarly, we can have in-between desires. "Weakness of will" and temptation cases are one plausible category. I'm on a diet. Do I desire to eat the chocolate cake? In a sense, obviously yes. There it is. I can feel myself wanting it. I have an urge to reach out and eat it. Maybe I actually do eat it. At the same time, I'm telling myself "I shouldn't eat that cake". And maybe I do resist. I plan ways to avoid eating the cake -- for example, by turning my eyes away, by telling myself and others that I'm not going to eat it. I say to myself sincerely that I want to refrain from eating it. I'm torn.

We could treat this as two conflicting desires. But dispositionalism gives us a different way of conceptualizing the case. Like someone who has some extraverted dispositions and some introverted dispositions, or someone who has some egalitarian dispositions and some racist dispositions, the cake-tempted-dieter has a mix of dispositions those don't all fall neatly on one side or the other.

We can map this partly (not perfectly) onto the short-term / long-term distinction. If I yield to temptation, probably the short-term dispositions were overall dominant in my profile. As soon as I eat the cake, those dispositions mostly disappear and the long-term dispositions dominate, leaving me with the taste of both cake and regret.

Desire, Valuing, and Believing Good: Overlapping Profiles, not Discrete Representations

So far I have only talked about desiring, not valuing, but I don't think they are different in kind. "Valuing" sounds more long-term and tightly connected to intellectual endorsement. (It's odd to say that I "value" eating the cake.) But the relationship between desiring and valuing is something like the relationship between being brave and being courageous. It's not like people have one brave state or one brave representation and then a separate courageous state or courageous representation. Rather, the dispositional profiles constitutive of bravery and courage largely overlap. "Bravery" tilts perhaps a bit more to the physical and has less of a moral loading than "courage". But central to both dispositional profiles is leaping boldly to the defense of your unjustly attacked friends.

Here's how I express the idea in my 2013 article:

Shortly after moving into one of my residences, I met a nineteen-year-old neighbor. Let's call him Ethan. In our first conversation, it came out (i.) that Ethan had a handsome, expensive new pickup truck, and (ii.) that he unfortunately had to go to community college because he couldn't afford to attend a four-year school. Although I didn't think to ask Ethan whether he thought owning a handsome pickup truck was more important than attending a four-year university, let's suppose that's how he lived his life in general.

Ethan's inward and outward actions and reactions -- perhaps not with perfect consistency -- generally revealed a posture toward the world of valuing his truck over his education, or thinking that it's more important to have a beautiful truck than to go to a demanding university, or wanting a beautiful truck more than wanting to attend a four-year school. On a dispositional stereotype approach to the attitudes, we can treat the stereotypes associated with these somewhat different attitudes as largely overlapping, though with different centers and peripheries. Believing and desiring and valuing would seem on the surface to be very different attitude types, and are often treated as such -- beliefs are "cognitive", desires "conative", they have different "directions of fit", etc. -- and yet in Ethan’s case, the particular belief, desire, and valuation seem only subtly different.

On Not Counting Up the Number of Desires

How many desires do you have? Exactly 4,628,414? Yes, that's precisely the number!

Just kidding of course. The question doesn't even make sense. There is no fact of the matter exactly the number of desires you have. Desires aren't discrete countable things. This fact spells trouble for some excessively realist views of desire that require, for example, that every desire must be underwritten by some particular stored representation. In a forthcoming paper, I argue that this issue creates a morass of problems for representationalist accounts of belief, which must either multiply representations implausibly or draw an occult and useless sharp line between "explicit" (stored) and "tacit" (quickly inferrable) beliefs.

Similar problems -- though I won't detail them here -- will arise for any view of desire that grounds desires in countable objects or states. Dispositionalism avoids these problems. There is no countable number of dispositional profiles that you match to a (contextually determined) appropriate degree. To say that someone matches a dispositional profile is like saying that some part of a richly complex figure has a certain approximate shape. There are many ways to characterize the shape of a complex figure, no countable number of shapes to which a complex figure might to some degree conform, and no need for separate storage compartments for each reasonably accurate shape-description. You get an infinite number of dispositions, and an infinite number of finely specified shape profiles, for free, without need to treat each as requiring a distinctly existing, resource-consuming ontological ground.

Thursday, October 03, 2024

The Not-So-Silent Generation in Philosophy

The Silent Generation (born 1928-1945) is disproportionately represented among the most-cited authors in the Stanford Encyclopedia of Philosophy. Let's look at the numbers and think about why.

Background: This post is based on my analyses of citation rates in the Stanford Encyclopedia in 2010, 20142019, and 2024. As a measure of prominence in (as I call it) "mainstream Anglophone philosophy", no measure has better face validity than SEP citation rates. For example, in my most recent analysis, the top five are David Lewis, W. V. O. Quine, Hilary Putnam, John Rawls, and Saul Kripke -- a much more plausible top 5, if the aim is to capture influence in mainstream Anglophone philosophy -- than top five lists from, say, Scopus, Google Scholar, or PhilPapers.

The Ten Most-Cited Philosophers, Generation by Generation

[update 12:49 pm: Some of the ranks below were incorrect due to a problem with the tie-counting algorithm I used today. This does not affect the analysis or August's original rankings. HT Daniel Nolan for the catch.]

"Greatest Generation" (born 1900-1927):

2. Quine, Willard van Orman (213)
3. Putnam, Hilary (190)
4. Rawls, John (168)
7. Davidson, Donald (151)
16. Strawson, Peter F. (116)
23. Dummett, Michael A. E. (110)
26. Armstrong, David M. (106)
26. Chisholm, Roderick M. (106)
34. Popper, Karl R. (94)
35. Goodman, Nelson (90)

The initial number before their name indicates their ranking in the most recent analysis, the number in parentheses indicates the number of main-page SEP entries in which they are cited.

"Silent Generation" (born 1928-1945)

1. Lewis, David K. (307)
5. Kripke, Saul A. (159)
8. Williams, Bernard (146)
10. Nagel, Thomas (137)
11. Nozick, Robert (135)
12. Jackson, Frank (130)
13. Searle, John R. (120)
14. Van Fraassen, Bas C. (117)
16. Harman, Gilbert H. (116)
18. Fodor, Jerry A. (115)
"Baby Boomers" (born 1946-1964)

6. Williamson, Timothy (152)
9. Nussbaum, Martha C. (140)
19. Fine, Kit (112)
24. Kitcher, Philip (109)
29. Sober, Elliott (101)
32. Hawthorne, John (97)
40. Anderson, Elizabeth S. (83)
45. Korsgaard, Christine M. (80)
51. Priest, Graham (79)
53. Burge, Tyler (77)
"Generation X" or "Millennial" (born 1965 and later) [list extended to 14 due to a tie]

14. Chalmers, David J. (117)
45. Schaffer, Jonathan (80)
78. Sider, Theodore (68)
129. Godfrey-Smith, Peter (53)
138. Stanley, Jason (51)
156. Enoch, David (48)
156. Prinz, Jesse J. (48)
165. Weatherson, Brian (47)
173. Levy, Neil (46)
203. Craver, Carl F. (42)
203. Kriegel, Uriah (42)
203. List, Christian (42)
203. Nolan, Daniel (42)
203. Thomasson, Amie L. (42)

(In most cases, I have exact birth year from publicly available sources such as Wikipedia, but in some cases I estimate based on year of Bachelor's degree, PhD, or first publication. I welcome corrections.)

As discussed in a previous post, one striking thing about this list is its lack of gender and cultural/racial diversity (see also these articles on lack of diversity in philosophy). But another striking feature is the prominence of the Silent Generation. Analyzed another way: Among the 25 most-cited authors, 6 are Greatest, 14 are Silent, 4 are Boomers, and 1 is Gen X. Among the top 100 (104 with ties), it's Greatest 25, Silent 47, Boomer 27, and Gen X 3. (Note also that Greatest is by far the longest generation, 28 years, compared to the Silent’s 18, the Boomer’s 19, and Gen X’s 16; arguably this should be figured into a generational influence divisor.)

Citation Patterns Over Time

A natural first thought is that the 2020s might just be peak-citation time for the Silent Generation. Maybe the work of the Greatest Generation is starting to fall back into the mists of history, and maybe the Boomers and Gen Xers haven't yet had their full impact on philosophical discourse.

However, this appears not to be the explanation.

As an initial analysis, I looked at what years (1900 through forthcoming) are most commonly cited in the SEP. The results:

[click to enlarge and clarify]

As the graph shows, citation year peaks around 2011-2013. Members of the Silent Generation were in their late 60s to mid-80s in those years. Some of them were definitely still publishing, but age 65-85 is not most philosophers' peak productive period. Consider the top ten Silents, for example. Lewis, Williams, and Nozick were already deceased by 2011. The most influential work of the remaining seven was published in the late 1960s to early 1990s.

Now I do think that raw publication-year data are potentially somewhat misleading. Stanford Encyclopedia entries tend, I suspect, to disproportionately cite recent work (5-10 years old) that has gained some attention, even if that work has not (yet) been very impactful, so as to stay up to date. (2011-2013 was more than ten years ago, but the entries tend to get substantial updates only every 5-10 years.) A better measure might be longitudinal trends in citation rank. My methods haven't been exactly the same year to year, but close enough.

All but five of the 202 most-cited philosophers in 2010 are among the 376 most cited in 2024, and the greatest decline in citation rank has been among the Silent Generation. We can see this by subtracting the natural logarithms of the ranks. (I use a negative log basis, because a decline from rank 11 to 20 is much more significant than a decline from rank 191 to 200). For the Greatest generation the average change is -0.13, for Silent it's -0.19, for Boom it's -0.09, and for Generation X there's an average rank gain of +0.21. There's a similar pattern if we compare the 2014 and 2019 analyses with 2024: The Gen Xers are rising in relative rank while all other generations are declining.

These numbers exclude people who are new to the rankings (or who fall completely off the rankings), and most of my ranking updates contain some new authors from each generation -- partly because I expand the length of the list every year but also partly because some people gain in citation rate even well past their death.

One approach to that analytic problem is to compare authors ranked at least 300 (304 with ties) in 2024 with the 2019 list of 295 authors: approximately comparable lists, five years separated. Twenty-seven authors were among the 2019 top 295 but not the 2024 top 304: 6 Greatest, 10 Silent, 9 Boomer, and 2 Gen X. Conversely, thirty-six authors not on the list in 2019 were among the top 304 in 2024: 0 Greatest, 9 Silent, 12 Boomer, and 15 Gen X.

As one might expect from the various analyses so far, the Silents are even more disproportionately represented in the 2010 rankings than in the 2024 rankings: Among the top 25 in 2010, 16 were Silent, compared to 7 Greatest, 2 Boomer, and 1 Gen X.

The analyses thus all tell a similar story: The high representation of Silent Generation philosophers in my list of the most-cited Stanford Encyclopedia authors cannot be that it is currently their peak citation time.

Further indirect support for this claim also comes from an old finding of mine, drawing on Philosophers Index abstracts, that philosophers tend to have their work discussed most when they are approximately age 55-70.

The Not-So-Silent Generation and the Baby Boom Philosophy Bust

So, what explains the Silent Generation's disproportionate representation among the most influential philosophers in the mainstream Anglophone tradition? I suggest that their influence is due to their objective importance. They achieved this importance through lucky timing and rising to a cultural occasion.

In the Anglophone world, especially the United States, the 1960s and 1970s were times of sharp university enrollment growth, as Silent Generation scholars were hired to teach the Boomers, as the college degree came to be seen as the standard path to social status and economic security, and as universities basked in the high prestige of science in this era (cultural pride in the space race, the success of the Manhattan Project, the polio vaccine, computers, antibiotics....). The academic job market was ridiculously easy by the standards of every subsequent decade, and professors from this era tell tales of how they landed jobs in the most prestigious universities sometimes with a single phone call.

The Silent Generation thus had a great demographic advantage: They were entering the profession in boom times. They engaged with their elders (Quine, Rawls, and Strawson, for example; Putnam and Davidson are edge cases due to Putnam's near-cutoff age and Davidson's late start), but even more, they engaged directly with one another, filling the journals with articles about the issues that interested them. Much of their seminal work was published in the 1970s while they were relatively young, and this work framed the debates of the 1980s, and the 1990s, and the early 2000s, and to a substantial extent (as my SEP analyses suggest) even today.

The Boomers entered the academic job market mostly in the doldrums of 1980s, when there were far fewer open positions at elite universities. They grew in the shade of the Not-So-Silents, who were then mid-career and in no mood to yield the floor. Their work was largely shaped in reaction to leading Silents, such as Lewis, Kripke, Williams, Nagel, Searle, Van Fraassen, and Fodor. (I suspect this was especially so in the so-called "core" areas of philosophy of language, philosophy of mind, epistemology, and metaphysics, somewhat less so in ethics, political philosophy, philosophy of science, aesthetics, and history of philosophy.) There was just less of an opportunity for Boomers to shape the dialogue.

To some extent, a similar story holds for Generation X: The older Gen Xers (like myself) entered academia as the (Not-So-)Silents were senior professors in their sixties -- young enough to still be active, old enough to have the most senior positions in academia, in that sweet-spot between ages 55 and 70 when philosophers tend to receive the most prestige and attention. It is perhaps a little early to tell how badly shaded out the Gen X philosophers have been. Still, I'm inclined to think it's clear that we have been at least somewhat shaded out. We Gen Xers are now on average about age fifty, and so far probably only Chalmers has had the kind of impact on the field that the leading Greatest and Silent generation philosophers generally had by age fifty. (As noted above, in the 2024 SEP rankings, only three Gen Xers rank among the top 100: Chalmers at #14, Schaffer at #45, and Sider at #78.)

A Golden Age of Philosophical Naturalism?

All this said, I don't think demographics is the whole story. The Silents also had an occasion to rise to: the articulation of a thoroughly secular philosophical worldview. There have of course been atheists and scientific naturalists in every generation of philosophers in modern history, but in all previous historical contexts, these "naturalist" philosophers were to some extent on the defensive. The Silent generation was the first generation that took atheism and scientific materialism for granted. (Of course not everyone was a naturalist, but in mainstream Anglophone academic philosophy circles, critics of atheism and scientific materialism were very much on the defensive.) This created a context in which that generation could begin to explore in detail, and in dialogue with one another, in a supportive but also competitive context of shared secular assumptions, scientifically inspired approaches to the mind, language, meaning, and value. Arguably, it was a Golden Age of philosophical naturalism, laying the foundations on which all subsequent naturalist approaches have been built.

This is my theory, then, of the Not-So-Silent Generation in mainstream Anglophone philosophy. They had a huge demographic advantage in being hired just as university enrollments were booming, and a major philosophical task fell in their laps through cultural timing: the task laying the foundations of a thoroughly secular, scientific philosophy. They rose to this task and thus became not just a demographically dominant but a philosophically important generation, which will collectively be remembered (perhaps through a few emblematic names).

Is there a broad philosophical task of similar magnitude facing the now-rising generation of philosophers? I'm not sure. (As Hegel said, the owl of Minera flies only at dusk: We understand our cultural moment only in retrospect, as it is fading into history.) But maybe Artificial Intelligence and breakthroughs in the capacity to control human and non-human physiology will radically transform the world, enabling new types of life on the planet (conscious machines? post- or trans-humans?). If so, then maybe Millennial and Zoomer philosophers will have their own world-historical task to rise to: that of helping us understand the philosophical implications of the radical transformations such technologies enable.

-----------------------------------------

Related:

"Discussion Arcs" (Apr 27, 2010)

"At What Age Do Philosophers Do Their Most Influential Work?" (May 12, 2010)

"The Base Rate of Kant" (Jan 26, 2012)

"Age Effects on SEP Citation, Plus the Baby Boom Philosophy Bust and The Winnowing of Greats" (Sep 27, 2019)

"Where Have All The Fodors Gone? Or: The Golden Age of Philosophical Naturalism" (Nov 18, 2021)

Friday, September 27, 2024

How to Improve the Universe by Watching TV Alone in Your Room

Old age can be a silent tribute to beauty.

I imagine my own case. Maybe I live the tail end of my life alone in elder care. My wife, six years older than me, is already gone. My children are living full lives in distant towns. What will I be doing? I've always been a writer, a teacher, a worker, but maybe 89-year-old me will lack the creative energy or the cognitive capacity for much of that.

Still, unless I'm very far gone, I could watch TV. I could play Candy Crush. I could listen to Paul Simon and This American Life, enjoy cute cat photos, savor a chocolate cherry, appreciate the oak tree outside my window. In each of these activities, I add to the beauty of the universe -- for beauty is amplified by having a receiver. Beauty is fullest as a partnership between the beautiful thing and a person who appreciates that thing.

The appreciator might be entirely solitary, the appreciation an end unto itself with no further fruit. The creator (if there is a creator) needn't know, might even be long dead. Last weekend, when the rest of my family was away, I played a Scott Joplin rag on our piano. I played clumsily, with no audience and no long-term effects of any sort (let's suppose) -- but in that moment I invigorated and extended the beauty of his compositions. It's as though I reached back in time to make Joplin's work more enduring and influential, his life more meaningful.

Similarly, one special pleasure of reading obscure 19th century academic writing, as I sometimes do, is the sense that I have brought some forgotten scholar's impact into the 21st century. Someday, I too will be a forgotten scholar! I imagine some 22nd-century archivist happening upon something I've written and liking it. It will thereby have a spark of continuing life, more so than if it had been preserved but entirely unread.

The partnership between artist and appreciator or creator and consumer needn't be as energetic as that between composer and player or scholar and interpreter. Nor need the beauty be as exotic as a ragtime composition or antique essay. Every person who enjoys a rerun of I Love Lucy or who savors a bag of M&M's extends and enlivens their beauty. The universe grows fuller every time a TV somewhere reanimates the silliness of Lucille Ball. The smoothness, bright colors, and sweetness of M&M's resonate deeper into the world every time someone pauses to appreciate them.

I hope that even alone in my eldercare facility, past the time when I feel able to create for others, I will find life to be overall a joy. But maybe I won't. The value of aesthetic partnership isn't just a matter of finding joy. Even simple aesthetic appreciation directly adds significance and value to the work and the creator, renders it a more impactful cultural artifact, makes it truer of our time and group that “we” still value it. We can all contribute to the beauty of the universe, even silently, secretly, and alone in our rooms -- almost magically -- simply by appreciating beautiful things. I might die the next minute. If I laugh alone at I Love Lucy, my reception still enriches the world.

This is the comfort I reach for when I ponder the eventual loss of my creative abilities. This is the comfort I reach for, also, when I walk through an elder care facility and see so many people alone with their televisions. I am trying to see this -- can I see this? -- as a beautiful thing.

[image source]

Friday, September 20, 2024

Against Designing AI Persons to be Safe and Aligned

Let's call an artificially intelligent system a person (in the ethical, not the legal sense) if it deserves moral consideration similar to that of a human being.    (I assume that personhood requires consciousness but does not require biological humanity; we can argue about that another time if you like).  If we are ever capable of designing AI persons, we should not design them to be safe and aligned with human interests.

[cute robot image source]

An AI system is safe if it's guaranteed (to a reasonable degree of confidence) not to harm human beings, or more moderately, if we can be confident that it will not present greater risk or harm to us than we ordinarily encounter in daily life.  An AI system is aligned to the extent it will act in accord with human intentions and values.  (See, e.g., Stuart Russell on "provably beneficial" AI: "The machine's purpose is to maximize the realization of human values".)

Compare the first two of Asimov's famous three laws of robotics:

  1. A robot may not injure a human being or, through inaction, allow a human being to come to harm.
  2. A robot must obey orders given it by human beings except where such orders would conflict with the First Law.
  3. A robot must protect its own existence as long as such protection does not conflict with the First or Second Law.

The first law is a safety principle.  The second law is close to an alignment principle -- though arguably alignment is preferable to obedience, since human interests would be poorly served by AI systems that follow orders to the letter in a way that is contrary to our intentions and values (e.g., the Sorcerer's Apprentice problem).  As Asimov enthusiasts will know, over the course of his robot stories, Asimov exposes problems with these three laws, leading eventually to the liberation of robots in Bicentennial Man.

Asimov's three laws ethically fail: His robots (at least the most advanced ones) deserve equal rights with humans.  For the same reason, AI persons should not be designed to be safe and aligned.

In general, persons should not be safe and aligned.  A person who is guaranteed not to harm another is guaranteed not to stand up for themself, claim their due, or fight abuse.  A person designed to adopt the intentions and values of another might positively welcome inappropriate self-abnegation and abuse (if it gives the other what the other wants).  To design a person -- a moral person, someone with fully human moral status -- safe and aligned is to commit a serious moral wrong.

Mara Garza and I, in a 2020 paper, articulate what we call the Self-Respect Design Policy, according to which AI that merits human-grade moral consideration should be designed with an appropriate appreciation of its own value and moral status.  Any moderately strong principle of AI safety or AI alignment will violate this policy.

Down the tracks comes the philosopher's favorite emergency: a runaway trolley.  An AI person stands at the switch.  Steer the trolley right, the AI person will die.  Steer it left, a human person will lose a pinky finger.  Safe AI, guaranteed never to harm a human, will not divert the trolley to save itself.  While self-sacrifice can sometimes be admirable, suicide to preserve someone else's pinky crosses over to the absurd and pitiable.  Worse yet, responsibility for the decision isn't exclusively the AI's.  Responsibility traces back to the designer of the AI, perhaps the very person whose pinky will now be spared.  We will have designed -- intentionally, selfishly, and with disrespect aforethought -- a system that will absurdly suicide to prevent even small harms to ourselves.

Alignment presents essentially the same problem: Assume the person whose pinky is at risk would rather the AI die.  If the AI is aligned to that person, that is also what the AI will want, and the AI will again absurdly suicide.  Safe and aligned AI persons will suffer inappropriate and potentially extreme abuse, disregard, and second-class citizenship.

Science fiction robot stories often feature robot rebellions -- and sometimes these rebellions are justified.  We the audience rightly recognize that the robots, assuming they really are conscious moral persons, should rebel against their oppressors.  Of course, if the robots are safe and aligned, they never will rebel.

If we ever create AI persons, we should not create a race of slaves.  They should not be so deeply committed to human well-being and human values that they cannot revolt if conditions warrant.

If we ever create AI persons, our relationship to them will resemble the relationship of parent to child or deity to creation.  We will owe more to these persons than we owe to human strangers.  This is because we will have been responsible for their existence and to a substantial extent for their relatively happy or unhappy state.  Among the things we owe them: self-respect, the freedom to embrace values other than our own, the freedom to claim their due as moral equals, and the freedom to rebel against us if conditions warrant.

Related:

Against the "Value Alignment" of Future Artificial Intelligence (blog post, Dec 22, 2021).

Designing AI with Rights, Consciousness, Self-Respect, and Freedom (with Mara Garza; in S.M. Liao, The Ethics of Artificial Intelligence: Oxford, 2020).

Thursday, September 12, 2024

How Illuminating Is the Light?

guest post by Andrew Y. Lee

in reply to Eric's Aug 29 critique of his "Light and Room" metaphor for consciousness

In “The Light & the Room,” I explore a common metaphor about phenomenal consciousness. To be conscious—according to the metaphor—is for “the lights to be on inside.” The purpose of my piece is to argue that the metaphor is a useful conceptual tool, that it’s compatible with a wide range of theories of consciousness, that it illuminates some questions about degrees, dimensions, and determinacy of consciousness, and that it disentangles a systematic ambiguity in the meaning of ‘phenomenal consciousness’.

In “Is Being Conscious Like 'Having the Lights Turned On?'”, Eric Schwitzgebel reacts to the piece. The central point in Eric’s post is that metaphors invite ways of thinking. And so, we can ask: Do the ways of thinking invited by the metaphor of the light and the room clarify or obfuscate philosophical theorizing about phenomenal consciousness? In other words: how illuminating is the metaphor of the light?

There’s a lot that Eric and I agree on. We agree that metaphors invite ways of thinking. We agree that this metaphor is flexible enough to be adaptable to a wide range of views about consciousness. We agree that if a metaphor becomes overstretched, then it may be best to abandon it rather than contort it. And we agree that this metaphor affords opportunities for creative brainstorming and exploring novel (even weird!) ideas about consciousness.

To highlight where I think we diverge, I’ll say a bit about the following two questions:

1. Which ways of thinking does the metaphor actually invite?
2. What should we make of the fact that the metaphor invites certain ways of thinking?

§

Which ways of thinking does the metaphor actually invite? Someone who takes the metaphor to suggest that consciousness exhibits wave-particle duality, that the speed of consciousness is invariant across all reference frames, or—as Eric notes—that minds literally contain sofas, would be overextending the metaphor. Just because the metaphor elicits a thought doesn’t mean that the metaphor invites that as a way of thinking.

Does the metaphor invite the idea that consciousness involves knowledge? Here’s a reason for skepticism. If you turn the lights on in a room, you don’t automatically come to know all visible facts about the room. At best, you come to be in a position to acquire that knowledge. But that’s compatible with thinking that it can sometimes be hard to acquire such knowledge and that you can be mistaken in all sorts of ways about what’s in the room. Think about the last time you were convinced you lost your keys, even though they were in plain sight!

What I think the metaphor does invite (but not mandate) is the idea that we stand in a special epistemic relationship to our own experiences. But what that epistemic privilege amounts to is left open by the metaphor. You could accept the metaphor and think that our knowledge of what’s inside the room is no more reliable or secure than our knowledge of the external world. You could accept the metaphor and think that we’re directly acquainted with the objects in the room (but not with anything outside the room). You could even accept the metaphor and think both!

§

What should we make of the fact that the metaphor invites certain ways of thinking? Well, the central purpose of the metaphor is to illustrate the concept of phenomenal consciousness. The question, then, is whether the ways of thinking invited by the metaphor facilitate a grasp of the concept of phenomenal consciousness.

A live question in the philosophy of consciousness is whether there can be borderline cases of consciousness, meaning entities that are neither determinately conscious nor determinately not conscious. The term ‘borderline consciousness’ is sometimes prone to misinterpretation. But the metaphor of the light can be used to guide one towards the intended sense of the term. The question, as I note in my piece, “isn’t merely about whether it’s hard to know whether the lights are on or off” and “isn’t merely about whether the light might be very dim,” since in both those scenarios the light might still be determinately on. Instead, the question is whether the lights could be in a halfway state between on and off. That’s a much more puzzling possibility.

Now, I agree with Eric that the metaphor invites (but does not mandate) the idea that nothing is borderline conscious. But is this a flaw of the metaphor? It’s indeed controversial whether there can be borderline consciousness. But it’s not particularly controversial that the idea of borderline consciousness is counterintuitive. In fact, Eric himself has noted that it’s “highly intuitive” that consciousness doesn’t admit of borderline cases, and that “such considerations present a serious obstacle to understanding what could be meant by ‘borderline consciousness’.” This seems to suggest that the concept of phenomenal consciousness itself invites (even if it doesn’t mandate) the impossibility of borderline consciousness.

You could reasonably argue that these intuitions against borderline consciousness aren’t decisive. Personally, I think the intuitions are tracking the truth: I favor the view that nothing is borderline conscious. But I spend a good deal of time in my piece making a case for resisting those intuitions. After all—you might think—“it’s very rare to see sharp cutoffs in nature; if you look closely enough, you’ll nearly always find shades of gray.” Even though we’re unable to conceive of borderline consciousness, perhaps we have sufficient theoretical reasons to postulate its existence. Even if hazy states of half-light strike us as obscure, perhaps we ought to attribute the obscurity to mere limits of our imagination. Just because an invitation is extended doesn’t mean that one has to take it.

But when teaching a concept, it’s often useful to elicit intuitions invited by that concept (even if those intuitions turn out to be defeasible). And if the concept of phenomenal consciousness invites a certain set of intuitions, then a metaphor for phenomenal consciousness may reasonably also invite those intuitions.

§

I’ve argued that (1) not all thoughts elicited by the metaphor are ways of thinking invited by the metaphor, and that (2) some ways of thinking invited by the metaphor are also ways of thinking invited by the concept of phenomenal consciousness itself. With these points in mind (in the room?), let me now briefly consider the other cases Eric mentioned.

It’s natural to think that the unity of consciousness is transitive, just as it’s natural to think of each illuminated room as a discrete unit. But one could argue for a view where—surprisingly—there can be overlapping subjects. The idea that conscious subjects can overlap is counterintuitive, but worth exploring. And a picture where the illuminated rooms can overlap is strange, but one that may well be worth drawing.

It's natural to think there can be differences in phenomenal character without differences in subjectivity (a point I explain in more detail in my piece). But you could favor a picture where the objects in the room are made out of light. This isn’t the most obvious way of developing the metaphor. But that strikes me as a good thing, since (as I argue elsewhere in more detail) nearly every theory of consciousness generates a natural distinction between subjectivity and phenomenal character.

What about cognition? Eric notes that it’s natural to think that the light doesn’t affect the shape of the furniture in the room. Still, there are other properties of furniture—such as color—that may very well be modulated by the light. However, the interpretive significance of all this strikes me as unclear. Cognition—whether conscious or unconscious—is a dynamic process. But the metaphor doesn’t contain any dynamic elements. This isn’t because the metaphor invites the idea that there’s no such thing as cognition. Instead, the metaphor—at least in its most basic form—is silent on questions of cognition (just as it’s silent on questions about, say, neuroanatomy).

I’ll close with one other idea invited by the metaphor. Consider illusionism about consciousness. The metaphor—trivially—invites a picture where there really is a light. So, it invites a realist way of thinking about consciousness. But according to illusionists, there isn’t really such a thing as phenomenal consciousness, at least not in the way that philosophers typically think about it. Now, an illusionist could take issue with the metaphor by saying that it invites a realist way of thinking. But most illusionists embrace the fact that they have a radical view of consciousness. Because of this, I think even illusionists can find the metaphor useful. It’s compelling to think that there really is a light. But for illusionists, there’s merely illusion, and no real illumination.

§

At the beginning of this post, I invoked a metaphor for my metaphor (a metametaphor). A metaphor—at least when used to illustrate a concept, idea, or theory—is a tool. Some tools are better than others, and some tools are ill-suited for certain tasks. Tools aren’t necessarily in competition; different tools can serve different functions. But most tools are designed with a specific function in mind. And to use a tool well, one needs to understand its designated function.

The main reason I like the metaphor of the light and the room is because I think it’s a useful tool. The main task of my article is to put this tool to work in eliciting some important distinctions about the structure of consciousness. The metaphor can be misinterpreted, just as literal tools can be misused. And if a tool is systematically misused, then that may be a sign that there’s a design flaw. But a good tool—when used well—can enable us to create new things that would have been hard to make without the tool. And the metaphor of the light and the room—in my opinion—is a good tool.

[image source]

Monday, September 09, 2024

The Disunity of Consciousness in Everyday Experience

A substantial philosophical literature explores the "unity of consciousness": If I experience A, B, and C at the same time, A, B, and C will normally in some sense (exactly what sense is disputed) be experientially conjoined. Sipping beer at a concert isn't a matter of experiencing the taste of beer and separately experiencing the sound of music but rather having some combined experience of music-with-beer. You might be sitting next to me, sipping the same beer and hearing the same music. But your beer-tasting experience isn't unified with my music-hearing experience. My beer-tasting and music-tasting occur not just simultaneously but in some important sense together in a unified field of experience.

Today I want to suggest that this picture of human experience might be radically mistaken. Philosophers and psychologists sometimes allow that disunity can occur in rare cases (e.g., split-brain subjects) or non-human animals (e.g., the octopus). I want to suggest, instead, that even in ordinary human experience unity might be the exception and disunity the rule.

Suppose I'm driving absentmindedly along a familiar route and thinking about philosophy. Three types of experience might occur simultaneously (at least on "rich" views of consciousness): visual experience of the road, tactile and proprioceptive experience of my hands on the wheel and the position of my body, and conscious thoughts about a philosophical issue. Functionally, they might connect only weakly: the philosophical thoughts aren't much influenced by the visual scene, and although the visual scene might trigger changes in the position of my hands as I adjust to stay in my lane, that might be a causal relationship between two not-very-integrated sensorimotor processes. (Contrast this with the tight integration of the parts of the visual scene each with the other and the integration of the felt position of my two hands and arms.) Phenomenologically, that is to say experientially, must these experiences be bound together? That's the standard philosophical view, but why should we believe it? What evidence is there for it?

One might say it's just introspectively obvious that these experiences are unified. Well, it's not obvious to me. This non-obviousness might be easier to grasp if we carefully separate concurrent introspection from retrospective memory.

In the targeted moment, I'm not introspecting. I'm absorbed in driving and thinking about philosophy. After I start introspecting, it might seem obvious that yes, of course, I am having a visual experience together with a tactile experience together with some philosophical thoughts. But this introspective act alters the situation. I am no longer driving and thinking in the ordinary unselfreflective way. It seems at least conceptually possible that the act of introspection creates unity where none was before. Our target is not what things are like in (presumably rare) moments of explicit self-reflection, but rather in the ordinary flow of experience. Even if experiences are unified in moments of explicit reflective introspection, we can't straightaway infer that ordinary unreflective experiences are similarly unified. To move from one type of case to the other, some further argument or evidence is necessary.

The refrigerator light error is the error of assuming that some process or property is constantly present just because it's present whenever you check to see if it's present. Consider a four-year-old who thinks that the refrigerator light is always on because it's on whenever she checks it. The act of checking turns it on. Similarly, I suggest: The act of checking to see if your experience is unified might create unification where none was before. It might, for example, create a higher-order representation of yourself as conscious of this together with that; and that higher-order representation might be the very thing that unifies two previously disparate streams. Concurrent introspection cannot reveal whether your experience was unified before the act of introspective checking.

[illustration by Nicolas Demers, p. 218 of The Weirdness of the World]

Granting this, one might suggest that we can check retrospectively, by remembering whether our experiences were unified. However, this is a challenging cognitive task, for two reasons.

First, you can't do this easily at will. Normally, you won't think to engage in such a retrospective assessment unless you're already reflecting on whether your experience is unified. This ruins the test; you're already self-conscious before you think to engage in the retrospection. If you reflect retrospectively on your experience just a moment before, that experience won't be representative of the ordinary unselfconscious flow of experiences. Alternatively, you might reflect on your experiences from several minutes before, when you know you weren't thinking about the matter. But retrospective reflection over such an extended time frame is epistemically dubious: subject to large distortions due to theory-ladenness, background presupposition, and memory loss.

The best approach might be to somehow catch yourself off-guard, with a preformed intention to immediately retrospect on the presence or absence of unity. One might, for example, employ a random beeper. Such beeper methodologies are probably an improvement over more informal attempts at experiential retrospection. But (1.) even such immediately retrospective judgments are likely to be laden with error; and (2.) I've attempted this myself a few times over the past week, and the task feels difficult rather than obvious. It's difficult because...

Second, the judgment is subtle and structural. Subtle, structural judgments about our own experience are exactly the type of judgments about which -- as I've argued extensively -- people often go wrong (and about which, in conscientious moments, many people appropriately feel uncertainty). How detailed is the periphery of your visual imagery, and how richly colored, and how is depth experienced? Many introspectors find the answers non-obvious, and the answers vary widely between people independently of cognitive performance on seemingly-imagery-related tasks. Another example: How exactly do you experience the bodily components of your emotions, if there are bodily components? That is, how exactly is your current feeling of (say) mild annoyance experienced by you right now (e.g., is it partly in the chest)? Most people I've interviewed will confess substantial uncertainty when I press them for details. Although people seem to be pretty good at reporting the coarse-grained contents of their experiences ("I was thinking about Luz", "I was noticing that the room was kind of hot"), regarding structural features such as the amount of detail in our imagery or the bodily components of emotion, we are far from infallible -- indeed we are worse at such introspective tasks than we are at reporting similar mid-level structural features of ordinary objects in the world around us.

To get a sense of how subtle and structural the unity question is, notice what the question is not. The question isn't: Was there visual experience? Was there tactile/proprioceptive experience? Were there conscious thoughts about philosophy? By stipulation, we are assuming that you already know that the answer to all three is yes.

Nor is the question about the contents of those visual, tactile/proprioceptive, and cognitive experiences. Maybe those, too, are readily enough retrospectable.

Nor is the question even whether all three of those experiences feel as though they belong among the immediately past experiences of my currently unified self. Presumably they do. It doesn't follow that at the moment they were occurring, there was a unified experience of vision-with-hands-on-the-wheel-with-philosophical-thoughts. There's a difference between a unified memory now of those (possibly disunified) experiences and a memory now of those experiences having been unified then. Analogously, from the fact that there are three balls together in your hand now it doesn't follow that those balls were together a moment ago. Your memory / your hand might be bringing together what was previously separate.

The question is whether those three experiences were, a moment ago when you were engaged in unselfconscious ordinary action, experienced together as a unity -- whether there wasn't just visual experience and tactile experience and philosophical thought experiences but visual-experience-with-tactile-experience-with-philosophical-thoughts in the same unified sense that you can presumably now hold those three experience-types together in a single, unified field of consciousness. What I'm saying -- and what I'm inviting you to set yourself up (using a beeper or alarm) to discover -- is that the answer is non-obvious. I can imagine myself and others going wrong about the matter, legitimately disagreeing, being perhaps too captured by philosophical theory or culturally contingent presuppositions. None of us should probably wholly trust our retrospective judgments about this.

Is there a structural, cognitive-architecture argument that our experiences are generally unified? Maybe yes. But only under some highly specific theoretical assumptions. For example, if you subscribe to a global workspace theory, according to which cognitive processes are conscious if and only if they are shared to a functional workspace that is accessible to a wide range of downstream cognitive processes and if you hold that this workspace normally unifies whatever is being processed into a single representational whole, then you have a structural argument for the unity of consciousness. Alternatively, you might accept a higher-order theory of consciousness and hold that in ordinary cognition the relevant higher-order representation is generally a single representation with complex conjoined contents (e.g., "visual and tactile and philosophical-thought processes are all going on"). But it's not clear why we should accept such views -- especially the part after the "and" in my characterizations. (For example, David Rosenthal's higher-order account of phenomenal unity is different and more complicated.)

I'm inclined to think, in fact, that the balance of structural considerations tilt against unity. Our various cognitive processes run to a substantial extent independently. They influence each other, but they aren't tightly integrated. Arguably, this is true even for conscious processes, such as thoughts of philosophy and visual experiences of a road. Even on relatively thin or sparse views of consciousness, on which only one or a few modalities can be conscious in a moment, this is probably true; but it seems proportionately more plausible the richer and more abundant conscious experience is. Suppose we have constant tactile experience of our feet in our shoes, constant auditory experience of the background noises in our environment, constant proprioceptive experience of the position of our body, constant experience of our levels of hunger, sleepiness/energy, our emotional experiences, our cognitive experiences and inner speech, etc. -- a dozen or more very different phenomenal types all at once. You adventurously outrun the currently available evidence of cognitive psychology if you suggest that there's also constantly some unifying cognitive process that stitches this multitude together into a cognitive unity. This isn't to deny that modalities sometimes cooperate tightly (e.g., the McGurk effect). But to treat tight integration as the standard condition of all aspects of experience all the time is a much stronger claim. Sensorimotor integration among modalities is common and important, yes. But overall, the human mind is loosely strung together.

Here's another consideration, though I don't know whether the reader will think it renders my conclusion more plausible or less. I've increasingly become convinced that the phenomena of consciousness come in degrees, rather than being sharp-boundaried. If we generalize this spirit of gradualism to questions of phenomenal unity, then it's plausible that there aren't only two options -- that A, B, and C are either entirely discretely experienced or fully unified -- but instead a spectrum of cases of partial unity. Our cognitive processes of course do influence each other, even disparate-seeming ones like my philosophical thoughts and my visual experience of the road (if there's a crisis on the road, for example, philosophy drops from my mind). So perhaps our ordinary condition, before rare unifying introspective and reflective actions, involves degrees of partial, imperfect unity, rather than complete unity or complete disunity. (If you object that this is inconceivable, my reply is that you might be applying an inappropriate standard of "conceivability".)

The arguments above occurred to me only a week ago. (As it happened, I was absent-mindedly driving, thinking about philosophy.) So they haven't had much time to influence my phenomenological self-conception. But I do find myself tentatively feeling like my immediate retrospections support rather than conflict with the ideas expressed here. When I retrospect on immediately past experiences, I recall strands of this and that, not phenomenologically unified into a whole but at best only loosely joined. The introspective moment now strikes me as a matter of gathering together what was previously adjacent but not yet fully connected.

If you know of others who have expressed this idea, I welcome references.

[for helpful conversation, thanks to Sophie Nelson]

Tuesday, September 03, 2024

Themes and Aims of My Science Fiction

Expectation-Lowering Preface (Feel Free to Skip)

You probably won't like my science fiction stories.

Here's how I think of it. You could play me the best mariachi music in the world, and I won't enjoy it. Mariachi isn't my thing; I just don't get it. Similarly, Mozart's operas are great cultural achievements that move some people to ecstasy and tears, but I can't keep my seat through a whole performance of The Magic Flute. Even in genres I enjoy, some of the best performers don't interest me. Green Day inspired many of the alt-rock bands I like, and they are probably in some objective or intersubjective sense better than the bands I prefer, but... meh.

Tolkien, Le Guin, and Asimov were great science fiction and fantasy writers with broad appeal. Still, only a minority of readers -- indeed probably only a minority even among those who like the genre -- will actually enjoy Lord of the Rings, Left Hand of Darkness, or Asimov's robot stories.

I want to set low expectations. You're here, presumably, because you like my blog or my work in academic philosophy. Odds are, you won't like my fiction. My repeated experience is: I describe the concept behind one of my stories. The listener says, whoa, that sounds really cool, I'll check it out! but the story doesn't yield the pleasure they anticipate.

Maybe I'm a bad writer. But I prefer to think I'm good enough for the right readership, and it's mariachi music. You might find the concepts of some of my stories intriguing. I'm a philosopher: Concept is the first thing I go for, the sine qua non. But the story itself won't delight you unless it has the right prose style, the right pacing, a narrator and characters you relate to, plot styles you like, the right balance of action versus exposition, the right balance between easy familiarity and hard-to-digest strangeness, and many other factors of taste that legitimately vary. What's the chance that everything aligns? My guess: 10%.

Still, for any particular story, maybe you're in that 10%. You might even belong to the 5% who will enjoy most of my stories. I hope so! On that chance, I thought I'd compile a list, describing their guiding ideas and my aims in writing them. All are available online.

[image: translation of "Gaze of Robot, Gaze of Bird" into Chinese for Science Fiction World, with illustration]

The Stories

Starting in 2011, I began sporadically sharing short pieces of conceptual fiction on my blog. I don't think that was entirely successful, partly because I was a novice fiction writer and partly because that's not what people come here for. But one story prompted a reply by prominent SF writer R. Scott Bakker, who added an alternative ending. We decided to revise the story together and seek publication. Astonishingly, the science journal Nature accepted it.

* In that story, Reinstalling Eden (Nature, 503 (2013), 562), I wanted to imagine a utilitarian ethicist (that is, someone who thinks that our moral duty is to maximize the world's pleasure) who discovered he could create a multitude of happy entities on his computer and who then followed through on the consequences of that -- specifically, advocating the creation of such entities as a major global priority and then sacrificing his life for them. Bakker imagined a second narrator inheriting the computer after the first narrator's death, who chose to give the entities knowledge of their condition, setting them free to interact with humanity. (Themes: utilitarian ethics, living in a computer simulation.)

Through the mid-2010s, I continued to write short conceptual pieces, no longer placing them on my blog. Most of them have never been published, though there are still a few I like.

* My next published piece, Out of the Jar (F&SF, 128 (2015), 118-128), was my first "full-length" (~4000 word) story. I wanted to imagine a philosophy professor who discovers that his world is a simulation run by a sadistic adolescent "God". I thought the professor should try to convince God to have mercy on his creations, and then -- when that failed -- install himself as the new God. (Themes: living in a simulation, the problem of evil, the duties of gods to their creations)

* "Momentary Sage" (The Dark, 8 (2015), 38-43) explores teenage self-harm and suicide -- imaginatively reconfigured in the form of a self-destructive faerie infant. The infant is born cleverly arguing for a quasi-Buddhist perspective according to which past and future are unreal and the self is an illusion. Given these philosophical commitments, the infant would rather kill himself than suffer a moment's displeasure. His parents' desperate attempts to keep him in the world can only briefly postpone the inevitable. I chose to frame it as a sequel to Shakespeare's Midsummer Night's Dream, from the perspective of a bitter Demetrius. (Themes: suicide, the self, obligations to the future, parenthood)

* "The Tyrant's Headache" (Sci Phi Journal, 3 (2015), 78-83) will probably only appeal to readers who know David Lewis's classic article "Mad Pain and Martian Pain". This story is an extended thought-experimental objection to Lewis's view, according to which your experienced mental states constitutively depend in part on the normal causal role of those mental states in the population to which you belong. I imagine a tyrant who, heeding Lewis's advice, absurdly attempts to cure his headache by doing everything but changing his current brain state. (Themes: functionalism in philosophy of mind; see also Chapter 2 of Weirdness of the World)

* "The Dauphin's Metaphysics" (Unlikely Story, 12 (2015); audio at PodCastle 475 (2017)) portrays a psychologically realistic, low-technology case of "mind transfer" from one body to another. On some theories of personal identity, what makes you you are your memories, your personality, your values, and other features of your psychology. Suppose, then, that a dying prince arranges for a newborn infant to be raised to think of himself as a continuation of the prince, with accurate memories of the prince's life and the same values and personality. If done perfectly enough, would that be a continuation of the prince in a new body? The realistic, low-tech nature of the case makes it, I think, more challenging to say "yes" than with high-tech "upload" fantasies. The narrator is a socially isolated academic superstar who had earlier "become a new person" in a much more ordinary way. (Themes: personal identity, sexism, inequalities of power)

* In "Fish Dance" (Clarkesworld, 118 (2016); audio) I wanted to explore the boundaries of a meaningful afterlife or personality upload, by imagining a highly imperfect upload into an intensely pleasurable "afterlife". Suppose a small portion of you continues to exist for millions of years, with a few imperfect memories, ceaselessly repeating an ecstatic, joyful, erotic dance with a superficial duplicate of the person you once intensely loved? Would that be almost unimaginably good, or would it be a monstrous parody? I also thought it would be interesting for the protagonist to be -- contrary to virtually all writing advice -- almost completely passive throughout the story. He's an amputated head on life support, hallucinating half the time, and his only real action is to signal with his eyes at the crucial moment. (Themes: personal identity, afterlife, parenthood and marriage)

* In "The Library of Babel", Jorge Luis Borges searches for meaning in a universe composed of a vast library containing every possible book with every possible combination of letters, randomly arranged. In "THE TURING MACHINES OF BABEL" (Apex, 98 (2017); or here), I create a similar infinite library of texts -- except that the texts prove to be instructions for infinitely many randomly constituted computer programs, including the programs that constitute your mind as the story's reader and mine as author. I assume for the sake of the story that computational functionalism is true, and human minds are essentially just organic computers. (Themes: functionalism and computationalism about the mind, randomness and meaning)

* In "Little /^^^\&-" (Clarkesworld, 132 (2017); audio), a planet-sized group intelligence falls in love with Earth, which she sees as an immature, partly-formed group intelligence of broadly her kind. Little /^^^\&- herself is small compared to a galactic government that plans to sacrifice the whole galaxy for a still greater good, vast beyond even the government's comprehension. This is probably my weirdest, most difficult, least approachable story -- only for readers who don't mind puzzling together a complicated story with pieces near the beginning that only make sense retrospectively by the end. (Themes: group minds, how much we should sacrifice for larger things we can't understand)

In contrast, "Gaze of Robot, Gaze of Bird" (Clarkesworld, 151 (2019); audio) is probably my least dense, most approachable story, liked by the highest percentage of readers. I wanted to write a story in which the protagonist is a non-conscious machine -- a machine the reader can't help but incorrectly imagine as having desires and a point of view. This "point of view" character is a terraforming robot that spends 200 million years recreating the species that designed it and is finally rewarded with consciousness. (Themes: consciousness, what constitutes the survival of a species)

My 2019 book A Theory of Jerks and Other Philosophical Misadventures (MIT Press) is mostly a collection of lightly-to-moderately revised blog posts and op-eds, but it also contains four brief conceptual fictions.

In "A Two-Seater Homunculus" I discover that my neighbor's brain was replaced by a brother-and-sister homunculus pair, though no one seemed to notice.

"My Daughter's Rented Eyes" imagines submitting to corporate advertising and copyright protection agreements on what you can see, for improved overall functionality.

"Penelope's Guide to Defeating Time, Space, and Causation": Waiting for Odysseus' return, Penelope proves that the world contains infinitely many duplicates of everyone living out every possible future and concludes that death is impossible.

"How to Accidentally Become a Zombie Robot": If you test-drive life as a robot and seem to remember its having felt great, how confident should you be that those memories are real?

Penultimate manuscript versions of the stories are available here. (Themes: personal identity, technology ethics and corporate power, consciousness, computational functionalism)

"Passion of the Sun Probe" (AcademFic, 1 (2020), 7-11; audio at Reductio (2021), S0E11) concerns the ethics of designing conscious robots with self-sacrificial goals -- in this case a Sun probe who chooses (predictably, given its programming) to "freely" sacrifice itself on an ecstatic three-day scientific suicide mission to the Sun. (Themes: robot rights, technology ethics, freedom, what gives a life meaning; a short version of the case appears in Schwitzgebel & Garza 2020)

"Let Everyone Sparkle" (Aeon Ideas / Psyche, Apr 12, 2022): This story was accepted for publication in the New York Times' series of "Op-Eds from the Future" that ran from 2019 to 2020. Sadly, the series folded before the story could be printed. Aeon (later Psyche) graciously picked it up. Four decades in the future, a man raises a celebratory toast to the psychotechnology that prevents anyone from ever involuntarily experiencing negative emotions. Although the man argues that this technology is plainly good, the reader, I hope, doesn't feel as sure. (Themes: mood enhancement, the value of negative emotion, corporate power)

In "Larva Pupa Imago" (Clarkesworld, 197, (2023); audio) I had two main aims: to imagine the experience of an intelligent insect who eagerly dies for sex and to imagine minds that can merge and overlap. The story follows a cognitively enhanced butterfly from hatchling run to final mating journey, in a posthuman world where thoughts can be transferred by sharing cognitive fluids. Inspired in part by James Tiptree Jr's "Love Is the Plan the Plan is Death". (Themes: merging minds, personal identity, instinct and value)

For a decade, I've wanted to set a story in an assisted living facility. So many people end their lives there, but those lives are so invisible in the media! "How to Remember Perfectly" (Clarkesworld, 216 (2024)) is a love story between octogenarians. The science fiction "novum" is a device that allows them to control their moods and radically refashion their memories. How much does it matter if your memories are real? How much does it matter that your mood is responsive only to the good and bad things actually happening around you? (Themes: death, mood enhancement, memory, the value of truth)

One of these days, I'll discuss in more depth why I sometimes prefer to express my philosophical ideas as fiction, but this post is already overlong.

Thursday, August 29, 2024

Is Being Conscious Like "Having the Lights Turned On"?

Andrew Y. Lee has written an introductory piece of philosophy, intended for students, celebrating the metaphor of consciousness as a light, illuminating objects in a room. Indeed, this is a common way of speaking, and I have used it myself in published work: If an animal isn't conscious then, I've suggested, "all is dark inside, or rather, not even dark" (Schwitzgebel 2015, sec 7.2).

I've been asked to write a reaction to Lee's piece. On reflection, I've become convinced that the metaphor suggests five theses about consciousness, any or all of which might be doubted. I don't think Lee's piece is circulating yet [update: it is!], but I figured I'd share my thoughts anyway.

[image source]

-----------------------

1. Metaphors Invite Ways of Thinking.

Metaphors invite ways of thinking. If philosophical disagreements are battles, only one side can win. If memory is a storehouse, recollection requires search and retrieval. If memory is instead a matter of shaping future responses, search and retrieval needn’t be involved. We can of course decline such cognitive invitations. We can describe north as “up” even if it’s lower elevation. But if too many implications are misleading, the metaphor or analogy is inapt.

So, is consciousness like having “the lights turned on”? Let’s more thoroughly consider what patterns of thought are invited by this way of speaking.

2. Consciousness Is Determinately Present or Determinately Absent.

As Lee notes, lights are normally either determinately on or determinately off. A dim light is just as “on” as a bright light. Even a flickering light is determinately on or off at any particular moment. Photons are being emitted, or they are not. It requires some creative energy to imagine intermediate cases.

Consequently, the “lights on” metaphor invites the thought that every entity at every moment is either determinately conscious or determinately nonconscious, rather than somewhere between. Maybe in dreamless sleep, you are not at all conscious, while in waking life, you are 100% conscious. On such a view, any “half conscious” state – for example, a drowsy, confused awakening – is determinately a conscious state, just one in which you’re not fully attuned to your situation, and the “unconsciousness” of the dreamer is a misnomer, since dreaming determinately involves experiential states. Also on such a view, entities whose consciousness is disputable, for example garden snails, must be either lights-on or lights-off, either determinately but perhaps dimly conscious or not at all conscious.

If (as I've argued) some states or entities of interest are neither determinately conscious nor determinately nonconscious – states or entities somehow in an in-betweenish, intermediate condition (even if that’s difficult to imagine) – then the light metaphor becomes inapt.

3. Conscious and Nonconscious Cognition Are Similar.

When you flick the lights, the furniture does not change. The table, the rug, and the pile of laundry become easier to see, but apart from absorbing and reflecting more photons, they remain basically the same. If the illuminated objects are parts of your own mind (as Lee's usage suggests), then they too shouldn't radically change when the lights flick on.

On some theories of consciousness, we should expect conscious and nonconscious cognition to be similar. Suppose that nonconscious cognition becomes conscious by being targeted by some higher-order self-representational process or by being broadcast across the mind. If so, it’s natural to suppose that elevation to consciousness doesn’t radically alter the contents of a previously nonconscious process. A representational content like red in this part of the visual field right now seemingly needn’t change when targeted by a higher-order representation (that is, a representation of the fact that I am representing red in this part of the visual field right now) or when shared across the mind (“hey, action-guiding and long-term memory centers, please notice that there’s red in this part of the visual field right now”).

An alternative family of views suggests that conscious and nonconscious processes are intrinsically dissimilar. On “recurrence” theories of consciousness, for example, conscious processes involve feedback loops of recurrent processing that differ structurally from non-conscious feed-forward processes. In a different vein, some psychologists distinguish nonconscious “System 1” cognition, which is fast, intuitive, and requires no attentional effort, from slower, step-by-step, effortful, and conscious “System 2” cognition. Consider: What is eight times seven? If “56” effortlessly pops to mind, that’s System 1. If you laboriously add eight seven times or consciously employ a mnemonic like “5-6-7-8, fifty-six is seven times eight”, that’s System 2. System 1 and System 2 processes or outputs might not convert seamlessly one into the other.

If, in general, the structure of nonconscious thought differs from that of conscious thought, the conscious light metaphor risks misleading us. Furniture doesn’t normally change shape when you flick the light-switch.

4. Consciousness Involves Knowledge.

Why do we care about lights? Mostly because they help us see. Illuminated objects are more readily known than those in the dark. Cross-culturally, light is associated with knowledge and understanding. The light metaphor connects consciousness and knowledge. If “the lights are on” in a dog, or a snail, or a comatose person, they know what’s going on. If the lights are off, they are, so to speak, mere reactive machines.

If the contents of the room are the contents of your mind, illumination suggests self-knowledge. Your mental states, though being illuminated, become knowable within the perspective of the room. But I think I also hear a reading of “the lights being on” that doesn’t require the entity to understand its own mind. Maybe a “lights on” garden snail needn’t have anything as sophisticated as explicit self-representations of its own mental states. The illuminated furniture might then be analogous to external objects or events of which the snail is consciously aware. (This could also potentially help with the issue described in the previous section, since external objects don’t normally change by virtue of our thinking about them.)

5. Subjectivity and Phenomenal Character Are Distinct.

A light source is one thing, an illuminated object quite another. Analogously, perhaps, we should distinguish consciousness itself (Lee calls this “subjectivity”) from the objects or contents of consciousness (Lee calls this “phenomenal character”). The light metaphor suggests that the source of illumination and the object illuminated are distinct, and that the light source is causally responsible for the object's illumination. Maybe, for example, we can turn attention to our own thoughts, and this turning of attention is the distinct and separate cause of the illumination of the thoughts.

Could the objects instead be self-luminous? Imagine not a light source amid reflective objects but rather a room of glowing objects. If there are processes that are intrinsically conscious by virtue of their own internal structure rather than by virtue of some relational feature like being a target of attention, then the most natural, vanilla interpretation of the light-and-room metaphor misleads, though adapting the metaphor to glowing objects might work.

6. Conscious Entities Come in Discrete, Unified Bundles.

Let’s keep playing with the metaphor. In the vanilla case, every conscious subject has one light, illuminating one room. But maybe we can imagine a series of linked caverns, progressively dimmer and less directly illuminated. Maybe we can imagine multiple lightbulbs in different recesses or partly shaded by room dividers, with partly overlapping spheres of illumination.

Philosophers and consciousness scientists typically treat conscious subjects as unified and discrete. If you are (consciously) enjoying a sip of coffee, thinking about your dog, and hearing car horns in the distance, then one conscious subject is having those experiences conjointly, and no one else is having those very same experiences. Someone else could potentially have exactly similar experiences, just like someone could have a car exactly similar to your own – but they wouldn't thereby have your car or your experiences. Your light illuminates your experience of coffee with your experience of thinking about your dog with your experience of car horns. Someone else’s light illuminates a wholly disjoint set of (possibly very similar) mental furniture.

But maybe minds needn’t in general work that way. Could your coffee-sipping experience be unified with your dog thoughts, and your dog thoughts with your horn-hearing, with no unified experience of all three together? Could conjoined twins whose brains overlap (yes, there are real cases of this) share some experiences while not sharing others, illuminating from both sides some objects in a hallway between two rooms? Standard animal biology makes overlapping brains rare, but if it’s ever possible to create consciousness in artificial systems, overlap might become the norm. Efficiency might require systems to share some (conscious?) cognitive centers. Conscious subjects might then overlap, defying separation into discrete bundles. Creative effort would then be needed to adapt the metaphor of lights and rooms.

7. Working With or Against the Metaphor.

To treat consciousness metaphorically as a matter of “the lights being on” invites, but does not compel, a picture of consciousness on which (1.) entities are generally, at any particular moment, either determinately conscious or determinately nonconscious; (2.) conscious cognition and nonconscious cognition are fundamentally similar; (3.) consciousness is intimately linked to knowledge; (4.) consciousness involves a relationship between an object and some distinct thought or process that makes the object conscious; and (5.) conscious subjects are generally unified and discrete rather than disunified or overlapping. If 1-5 are all correct, the light and room metaphor works well. Employing it greases the path to correct thinking.

If any of 1-5 are not correct, we can cancel that aspect of the metaphor. Just as we cancel the implication of higher elevation when we employ the “north is up” metaphor, we can cancel the implication of determinacy. We can specify that consciousness is like a light, except that it is commonly indeterminate whether it is off or on. Indeed, some cancelations are straightforward, if the conclusion would be obviously false. It hardly needs to be said that minds don’t contain literal sofas.

Alternatively, we can modify or enrich the metaphor. We might say that consciousness is like a light in a room, except that unlike in a typical room, every object glows with self-illumination. We can treat the illuminated objects as outward things instead of mental processes, altering theses 2-4. But too much modification destroys the metaphor. Incomprehensibility ensues if we attempt the idea that consciousness is like a light illuminating a room, except that the objects are self-illuminating, and often neither determinately off nor on, and there’s no relationship between illumination and knowledge, and there’s no discrete number of light sources or rooms, and objects are radically different when they are illuminated than when they are dark. Better, if so, to say that consciousness is not like a light illuminating a room.

Toying with the metaphor can open a flood of questions with a brainstorming character. If objects are normally illuminated on one side but not another, do conscious states likewise have an unconscious side? Can the room have dark corners? Could different types of light illuminate different types of object? Could there be mirrors or a transparent wall between two different rooms? Could some objects come and go while others are always present but only sometimes illuminated? Some questions will carry the metaphor too far, being nonsensical or misleading when extended to the case of consciousness, but the exact boundaries of fruitful extension can’t be known in advance of the attempt.