A classic machine learning data set describing hypothetical samples from the Agaricus and Lepiota family.
mushrooms
A data frame with 7597 rows and 361 variables:
p=poisonous, e=edible
bell=b, conical=c, convex=x, flat=f, knobbed=k, sunken=s
fibrous=f, grooves=g, scaly=y, smooth=s
brown=n, buff=b, cinnamon=c, gray=g, green=r, pink=p, purple=u, red=e, white=w, yellow=y
bruises=t, no=f
almond=a, anise=l, creosote=c, fishy=y, foul=f, musty=m, none=n, pungent=p, spicy=s
attached=a, descending=d, free=f, notched=n
close=c, crowded=w, distant=d
broad=b, narrow=n
black=k, brown=n, buff=b, chocolate=h, gray=g, green=r, orange=o, pink=p, purple=u, red=e, white=w, yellow=y
enlarging=e, tapering=t
bulbous=b, club=c, cup=u, equal=e, rhizomorphs=z, rooted=r, missing=?
fibrous=f, scaly=y, silky=k, smooth=s
fibrous=f, scaly=y, silky=k, smooth=s
brown=n, buff=b, cinnamon=c, gray=g, orange=o, pink=p, red=e, white=w, yellow=y
brown=n, buff=b, cinnamon=c, gray=g, orange=o, pink=p, red=e, white=w, yellow=y
partial=p, universal=u
brown=n, orange=o, white=w, yellow=y
none=n, one=o, two=t
cobwebby=c, evanescent=e, flaring=f, large=l, none=n, pendant=p, sheathing=s, zone=z
black=k, brown=n, buff=b, chocolate=h, green=r, orange=o, purple=u, white=w, yellow=y
abundant=a, clustered=c, numerous=n, scattered=s, several=v, solitary=y
grasses=g, leaves=l, meadows=m, paths=p, urban=u, waste=w, woods=d
Data gathered from:
Mushroom records drawn from The Audubon Society Field Guide to North American Mushrooms (1981). G. H. Lincoff (Pres.), New York: Alfred A. Knopf