Book Read Free

Everybody Lies

Page 28

by Seth Stephens-Davidowitz


  Kadyrov, Akhmad, 227

  Kahneman, Daniel, 283

  Kane, Thomas, 255

  Katz, Lawrence, 243

  Kaufmann, Sarah, 236–37

  Kawachi, Ichiro, 266

  Kayak (website), 265

  Kennedy, John F., 170, 171, 227

  Kerry, John, 8, 244

  King John (Shakespeare), 89–90

  King, Martin Luther Jr., 132

  King, William Lyon Mackenzie (alias), 138–39

  Kinsey, Alfred, 113

  Kirkpatrick, David, 154

  Klapper, Daniel, 225

  Knight, Phil, 157

  Kodak, and pictures as data, 99

  Kohane, Isaac, 203–5

  Krueger, Alan B., 56, 238

  Ku Klux Klan, 12, 137

  Kubrick, Stanley, 190–91

  Kundera, Milan, 233

  language

  and digital revolution, 274, 279

  emphasis in, 94

  as key to understanding bias, 74–76

  and paying back loans, 259–60

  and traditional research methods, 274

  and U.S. as united or divided, 78–79

  See also words

  learning. See education

  Lemaire, Alain, 257–61

  Levitt, Steven, 36, 222, 254, 280, 281. See also Freakonomics

  liberals

  and origins of political preferences, 169–71

  and parents prejudice against children, 136

  and truth about the internet, 140, 141–45

  and words as data, 75–76, 93, 95–96

  library cards, and lying, 106

  life, as imitating art, 190–97

  life expectancy, 176–78

  Linden, Greg, 203

  listening, and dating, 82n

  loans, paying back, 257–61

  Los Angeles Times, and Obama speech about terrorism, 130

  lotteries, 229, 229n

  Luca, Michael, 265

  Lycos (search engine), 60

  lying

  and age, 108n

  and incentives, 108

  and jokes, 109

  to ourselves, 107–8, 109

  and polls, 107

  and pornography, 110

  prevalence of, 21, 105–12, 239

  and racism, 109

  reasons for, 106, 107, 108, 108n

  and reimaging data, 103

  and search information, 5–6, 12

  and sex, 112–28

  by Stephens-Davidowitz, 282n

  and surveys, 105–7, 108, 108n

  and taxes, 180

  and voting behavior, 106, 107, 109–10

  “white,” 107

  See also digital truth serum; truth; specific topic

  Ma-Kellams, Christine, 266

  Macon County, Alabama, successful/notable Americans from, 183, 186–87

  Malik, Tashfeen, 129–30

  Manchester University, and dimensionality study, 247–48

  Massachusetts Institute of Technology, Pantheon project of, 184–85

  Matthews, Dylan, 202–3

  McCaffrey, Ed, 196–97n

  McFarland, Daniel, 80

  McPherson, James, 79

  measurability, overemphasis on, 252–56

  “Measuring Economic Growth from Outer Space” (Henderson, Storygard, and Weil), 99–101

  media

  bias of, 22, 74–77, 93–97, 102–3

  and examples of Big Data searches, 22

  owners of, 96

  and truth about hate and prejudice, 130, 131

  and truth about the internet, 143

  and words as data, 74–77, 93–97

  See also specific organization

  Medicare, and doctors reimbursements, 230, 240

  medicine. See doctors; health

  Messing, Solomon, 144

  MetaCrawler (search engine), 60

  Mexicans, and truth about hate and prejudice, 129

  Michel, Jean-Baptiste, 76–77, 78–79

  Microsoft

  and Cambridge University study about IQ of Facebook users, 261

  Columbia University pancreatic cancer study and, 28–29, 30

  and typing errors by searchers, 48–50

  Milkman, Katherine L., 91–92

  Minority Report (movie), 266

  Minsky, Marvin, 273

  minutes, zooming in on, 190–97

  Moneyball, Oakland A’s profile in, 254, 255

  Moore, Julianne, 185

  Moskovitz, Dustin, 238–39

  movies

  and advertising, 224–25

  and crime, 193, 194–95, 273

  violent, 190–97, 273

  zooming in on, 190–97

  See also specific movie

  msnbc.com, 143

  murder

  and danger of empowered government, 266–67, 268–69

  See also violence

  Murdoch, Rupert, 96

  Murray, Patty, 256

  Muslims

  and danger of empowered governments, 266–67, 268–69

  and truth about hate and prejudice, 129–31, 162–63

  Nantz, Jim, 223

  National Center for Health Statistics, 181

  National Enquirer magazine, 150–51, 152

  national identity, 78–79

  natural experiments, 226–28, 229–30, 234–37, 239–40

  NBA. See basketball

  neighbors, and monetary windfalls, 229

  Netflix, 156–57, 203, 212

  Netzer, Oded, 257–61

  New England Patriots-Baltimore Ravens games, 221, 222–24

  New Jack City (movie), 191

  New York City, Rolling Stones song about, 278

  New York magazine, and A/B testing, 212

  New York Mets, 165–66, 167, 169, 171

  New York Post, and words as data, 96

  New York Times

  Clinton (Bill) search in, 61

  and IQDNA study results, 249

  and Obama speech about terrorism, 130

  Stephens-Davidowitz’s first column about sex in, 282

  Stormfront users and, 137, 140, 145

  and truth about internet, 145

  types of stories in, 92

  vaginal odors story in, 161

  and words as data, 95–96

  New York Times Company, and words as data, 95–96

  New Yorker magazine

  Duflo study in, 209

  and Stephens-Davidowitz’s doppelganger search, 202

  News Corporation, 96

  newslibrary.com, 95

  Nielsen surveys, 5

  Nietzsche, Friedrich, 268

  Nigeria, pregnancy in, 188, 189, 190

  “nigger”

  and hate and prejudice, 6, 7, 131–34, 244

  jokes, 6, 15, 132, 133, 134

  motivation for searches about, 6

  and Obama’s election, 7, 244

  and power of Big Data, 15

  prevalence of searches about, 6

  and Trump’s election, 14

  night light, and pictures as data, 100–101

  Nike, 157

  Nixon, Richard M., 170, 171

  numbers, obsessive infatuation with, 252–56

  Obama, Barack

  and A/B testing, 211–14

  campaign home page for, 212–14

  elections of 2008 and, 2, 6–7, 133, 134, 211–12

  elections of 2012 and, 8–9, 10, 133, 134, 211–12

  and racism in America, 2, 6–7, 8–9, 12, 134, 240, 243–44

  State of the Union (2014) speech of, 159–60

  and truth about hate and prejudice, 130–31, 133, 134, 162–63

  Ocala horse auction, 65–66, 67, 69

  Oedipal complex, Freud theory of, 50–51

  OkCupid (dating site), 139

  Olken, Benjamin A., 227, 228

  127 Hours (movie), 90, 91

  Optimal Decisions Group, 262

  Or, Flora, 266

  Ortiz, David “Big
Papi,” 197–200, 200n, 203

  “out-of-sample” tests, 250–51

  Page, Larry, 60, 61, 62, 103

  pancreatic cancer, Columbia University-Microsoft study of, 28–29

  Pandora, 203

  Pantheon project (Massachusetts Institute of Technology), 184–85

  parents/parenting

  and child abuse, 145–47, 149–50, 161

  and examples of Big Data searches, 22

  and prejudice against children, 134–36, 135n

  Parks, Rosa, 93, 94

  Parr, Ben, 153–54

  Pathak, Parag, 235–36

  PatientsLikeMe.com, 205

  patterns, and data science as intuitive, 27, 33

  Paul, Chris, 37

  paying back loans, 257–61

  PECOTA model, 199–200, 200n

  pedigrees

  of basketball players, 67

  of horses, 66–67, 69, 71

  pedometer, Chance emphasis on, 252–53

  penis

  and Freud’s theories, 46

  and phallic symbols in dreams, 46–47

  size of, 17, 19, 123–24, 124n, 127

  “penistrian,” 45, 46, 48, 50

  Pennsylvania State University, income of graduates of, 237–39

  Peysakhovich, Alex, 254

  phallic symbols, in dreams, 46–48

  Philadelphia Daily News, and words as data, 95

  Philippines, cigarette economy in, 102

  physical appearance

  and dating, 82, 120n

  and parents prejudice against children, 135–36

  and truth about sex, 120, 120n, 125–26, 127

  physics, as science, 272–73

  pictures, as data, 97–102, 103

  Pierson, Emma, 160n

  Piketty, Thomas, 283

  Pinky Pizwaanski (horse), 70

  pizza, information about, 77

  PlentyOfFish (dating site), 139

  Plomin, Robert, 249–50

  political science, and digital revolution, 244, 274

  politics

  and A/B testing, 211–14

  complexity of, 273

  and ignoring what people tell you, 157

  and origin of political preferences, 169–71

  and truth about the internet, 140–44

  and words as data, 95–97

  See also conservatives; Democrats; liberals; Republicans

  polls

  Google searches compared with, 9

  and lying, 107

  reliability of, 12

  See also specific poll or topic

  Pop-Tarts, 72

  Popp, Noah, 202

  Popper, Karl, 45, 272, 273

  PornHub (website), 14, 50–52, 54, 116, 120–22, 274

  pornography

  as addiction, 219

  and bias of social media, 151

  and breastfeeding, 19

  cartoon, 52

  child, 121

  and digital revolution, 279

  and gays, 114–15, 114n, 116, 117, 119

  honesty of data about, 53–54

  and incest, 50–52

  in India, 19

  and lying, 110

  popular videos on, 152

  popularity of, 53, 151

  and power of Big Data, 53

  search engines for, 61n

  and truth about sex, 114–15, 117

  unemployed and, 58, 59

  Posada, Jorge, 200

  poverty

  and life expectancy, 176–78

  and words as data, 93, 94

  See also income distribution

  predictions

  and data science as intuitive, 27

  and getting the numbers right, 74

  and what counts as data, 74

  and what vs. why it works, 71

  See also specific topic

  pregnancy, 20, 187–90

  prejudice

  implicit, 132–34

  of parents against children, 134–36, 135n

  subconscious, 134, 163

  truth about, 128–40, 162–63

  See also bias; hate; race/racism; Stormfront

  Premise, 101–2, 103

  price discrimination, 262–65

  prison conditions, and crime, 235

  privacy issues, and danger of empowered government, 267–70

  property rights, and words as data, 93, 94

  proquest.com, 95

  Prosper (lending site), 257

  Psy, “Gangnam Style” video of, 152

  psychics, 266

  psychology

  and digital revolution, 274, 277–78, 279

  as science, 273

  as soft science, 273

  and traditional research methods, 274

  Quantcast, 137

  questions

  asking the right, 21–22

  and dating, 82–83

  race/racism

  causes of, 18–19

  elections of 2008 and, 2, 6–7, 12, 133

  elections of 2012 and, 2–3, 8, 133

  elections of 2016 and, 8, 11, 12, 14, 133

  explicit, 133, 134

  and Harvard Crimson editorial about Zuckerberg, 155

  and lying, 109

  map of, 7–9

  and Obama, 2, 6–7, 8–9, 12, 133, 240, 243–44

  and predicting success in basketball, 35, 36–37

  and Republicans, 3, 7, 8

  Stephens-Davidowitz’s study of, 2–3, 6–7, 12, 14, 243–44

  and Trump, 8, 9, 11, 12, 14, 133

  and truth about hate and prejudice, 129–34, 162–63

  See also Muslims; “nigger”

  randomized controlled experiments

  and A/B testing, 209–21

  and causality, 208–9

  rape, 121–22, 190–91

  Rawlings, Craig, 80

  “rawtube” (porn site), 59

  Reagan, Andy, 88, 90, 91

  Reagan, Ronald, 227

  regression discontinuity, 234–36

  Reisinger, Joseph, 101–2, 103

  relationships, lasting, 31–33

  religion, and life expectancy, 177

  Renaissance (hedge fund), 246

  Republicans

  core principles of, 94

  and origins of political preferences, 170–71

  and racism, 3, 7, 8

  and words as data, 93–97

  See also specific person or election

  research

  and expansion of research methodology, 275–76

  See also specific researcher or research

  reviews, of businesses, 265

  “Rocket Tube” (gay porn site), 115

  Rolling Stones, 278

  Romney, Mitt, 10, 212

  Roseau County, Minnesota, successful/notable Americans from, 186, 187

  Runaway Bride (movie), 192, 195

  sabermetricians, 198–99

  San Bernardino, California, shooting in, 129–30

  Sands, Emily, 202

  science

  and Big Data, 273

  and experiments, 272–73

  real, 272–73

  at scale, 276

  soft, 273

  search engines

  differentiation of Google from other, 60–62

  for pornography, 61n

  reliability of, 60

  word-count, 71

  See also specific engine

  searchers, typing errors by, 48–50

  searches

  negative words used in, 128–29

  See also specific search

  “secrets about people,” 155–56

  Seder, Jeff, 63–66, 68–70, 71, 74, 155, 256

  segregation, 141–44. See also bias; discrimination; race/racism

  self-employed people, and taxes, 178–80

  sentiment analysis, 87–92, 247–48

  sex

  as addiction, 219

  and benefits of digital truth serum, 158–59, 161

  and ch
ildhood experiences, 50–52

  condoms and, 5, 122

  and digital revolution, 274, 279

  and dimensions of sexuality, 279

  during marriage, 5–6

  and fetishes, 120

  and Freud, 45–52

  Google searches about, 5–6, 51–52, 114, 115, 117, 118, 122–24, 126, 127–28

  and handling the truth, 158–59, 161

  and Harvard Crimson editorial about Zuckerberg, 155

  how much, 122–23, 124–25, 127

  in India, 19

  new information about, 19

  oral, 128

  and physical appearance, 120, 120n, 125–26, 127

  and power of Big Data, 53

  pregancy and having, 189

  Rolling Stones song about, 278

  and sex organs, 123–24

  Stephens-Davidowitz’s first New York Times column about, 282

  and traditional research methods, 274

  truth about, 5–6, 112–28, 114n, 117

  and typing errors, 48–50

  and women’s genitals, 126–27

  See also incest; penis; pornography; rape; vagina

  Shadow (app), 47

  Shakespeare, William, 89–90

  Shapiro, Jesse, 74–76, 93–97, 141–44, 235, 273

  “Shattered” (Rolling Stones song), 278

  shopping habits, predictions about, 71–74

  The Signal and the Noise (Silver), 254

  Silver, Nate, 10, 12–13, 133, 199, 200, 254, 255

  Simmons, Bill, 197–98

  Singapore, pregnancy in, 190

  Siroker, Dan, 211–12

  sleep

  and digital revolution, 279

  Jawbone and, 276–77

  and pregnancy, 189

  “Slutload,” 58

  small data, 255–56

  smiles, and pictures as data, 99

  Smith, Michael D., 224

  Snow, John, 275

  Sochi, Russia, gays in, 119

  social media

  bias of data from, 150–53

  doppelganger hunting on, 201–3

  and wives descriptions of husbands, 160–61, 160–61n

  See also specific site or topic

  social science, 272–74, 276, 279

  social security, and words as data, 93

  socioeconomic background

  and predicting success in basketball, 34–41

  See also pedigrees

  sociology, 273, 274

  Soltas, Evan, 130, 162, 266–67

  South Africa, pregnancy in, 189

  Southern Poverty Law Center, 137

  Spain, pregnancy in, 190

  Spartanburg Herald-Journal (South Carolina), and words as data, 96

  specialization, extreme, 186

  speed, for transmitting data, 56–59

  “Spider Solitaire,” 58

  Stephens-Davidowitz, Noah, 165–66, 165–66n, 169, 206, 263

  Stephens-Davidowitz, Seth

  ambitions of, 33

  lying by, 282n

  mate choice for, 25–26, 271

  motivations of, 2

  obsessiveness of, 282, 282n

  professional background of, 14

  and writing conclusions, 271–72, 279, 280–84

 

‹ Prev