Selective Brain Damage: Measuring the Disparate Impact of Model Pruning

Hooker, Sara; Courville, Aaron; Dauphin, Yann; Frome, Andrea

PIE images are more challenging for both pruned and non-pruned models to classify. Pruning appears to cause deep neural networks to "forget" the examples where there is already a high level of predictive uncertainty.

Go ahead and click on the buttons below to view a sample of ImageNet PIEs in each category. The labels below each image are: 1) true ground truth ImageNet label, 2) baseline non-pruned modal prediction, 3) most frequent prediction from a population of pruned ResNet-50 models.

abstract exemplars:a sample of PIEs where the class object is in an abstract form, such as a painting, drawing or rendering using a different material.

fine grained: a sample of PIEs where the image depicts an object that is semantically close to various other classes present the data set (e.g., rock crab and fiddler crab, cuirass and breastplate).

atypical exemplars: a sample of PIEs where the image would be considered by a human to be an unusual or outlier example from the distribution of images in a given category.

True Label: toilet tissue
Non-Pruned:bath towel
Pruned:
great white shark

True Label: cauliflower
Non-Pruned: cauliflower
Pruned:artichoke

True Label: sombrero
Non-Pruned: cowboy hat
Pruned:dough

True Label: pop bottle
Non-Pruned: restaurant
Pruned:barber shop

True Label: bathtub
Non-Pruned: bathtub
Pruned:cucumber

True Label: toilet seat
Non-Pruned: toilet seat
Pruned:folding chair

True Label: plastic bag
Non-Pruned: gown
Pruned:plastic bag

True Label: espresso
Non-Pruned: espresso
Pruned:red wine

True Label: coffeepot
Non-Pruned: espresso maker
Pruned:coffeepot

True Label: cuirass
Non-Pruned: breastplate
Pruned:cuirass

True Label: cradle
Non-Pruned: bassinet
Pruned:cradle

True Label: valley
Non-Pruned: valley
Pruned:alp

True Label: cloak
Non-Pruned: gas mask
Pruned:breast plate

True Label: gas pump
Non-Pruned: gas pump
Pruned:traffic light

True Label: maze
Non-Pruned: maze
Pruned:crossword puzzle

True Label: beer bottle
Non-Pruned: beer bottle
Pruned:sunscreen

True Label: jack o lantern
Non-Pruned: jack o lantern
Pruned:lampshade

True Label: petri dish
Non-Pruned: espresso
Pruned:petri dish

True Label: limousine
Non-Pruned: bob sled
Pruned:snowplow

True Label: rocking chair
Non-Pruned: rocking chair
Pruned:barber chair

True Label: grey whale
Non-Pruned: grey whale
Pruned:killer whale

True Label: screen
Non-Pruned: screen
Pruned:television

True Label: christmas stocking
Non-Pruned: sock
Pruned:christmas stocking

True Label: breakwater
Non-Pruned: lakeside
Pruned:seashore

PIEs overindex on data that is poorly structured for a single image classification tasks. For these images, predicting the correct ground truth may be an incomplete measure of generalization ability to unseen data. For example, a pruned model that predicts suit instead of the true label of groom would still be considered accurate by most humans. The groom is wearing a suit and thus both labels could be acceptable. However, this prediction would be penalized by measures such as top-1 accuracy.

Click on the buttons below to view a sample of ImageNet PIEs in each category. The labels below each image are: 1) true ground truth ImageNet label, 2) baseline non-pruned modal prediction, 3) most frequent prediction from a population of pruned ResNet-50 models.

incorrect or inadequate ground truth: a sample of PIEs where the ground truth label for the image is incorrect or there is insufficient information for a human to predict the correct ground truth label.

multiple objects: a sample of PIEs where the image depicts multiple objects, a human may consider several labels to be appropriate predictions (e.g., desktop computer consisting of a screen, mouse and monitor, a barber chair in a barber shop, a wine bottle which is full of red wine).

frequently co-occuring labels: a sample of PIEs where multiple object(s) occur frequently together in the same image. In certain cases, such as projectile and missile, this is because both labels are acceptable to describe the same object.

True Label: bakery
Non-Pruned: french loaf
Pruned:bakery

True Label: dock
Non-Pruned: container ship
Pruned:dock

True Label: hammer
Non-Pruned: carpenter's kit
Pruned:hammer

True Label: piggy bank
Non-Pruned: mushroom
Pruned:jigsaw puzzle

True Label: barber chair
Non-Pruned: barber chair
Pruned:barbershop

True Label: groom
Non-Pruned: groom
Pruned:suit

True Label: mortarboard
Non-Pruned: academic gown
Pruned:mortarboard

True Label: paddle
Non-Pruned: paddle
Pruned:canoe

True Label: tub
Non-Pruned: cauldron
Pruned:wok

True Label: sleeping bag
Non-Pruned: apron
Pruned:bib

True Label: crash helmet
Non-Pruned: gas mask
Pruned:lens cap

True Label: polecat
Non-Pruned: black footed ferret
Pruned:malamute

True Label: guacamole
Non-Pruned: burrito
Pruned:plate

True Label: confectionary
Non-Pruned: packet
Pruned:grocery store

True Label: parallel bars
Non-Pruned: parallel bars
Pruned:horizontal bars

True Label: desktop computer
Non-Pruned: screen
Pruned:monitor

True Label: tennis ball
Non-Pruned: tennis ball
Pruned:racket

True Label: wine bottle
Non-Pruned: red wine
Pruned:wine bottle

True Label: projectile
Non-Pruned: missile
Pruned:projectile

True Label: corn
Non-Pruned: corn
Pruned:ear (of corn)

True Label: restaurant
Non-Pruned: meat loaf
Pruned:guacamole

True Label: envelope
Non-Pruned: dumbbell
Pruned:maraca

True Label: wool
Non-Pruned: pole
Pruned:wing

True Label: radio
Non-Pruned: radio
Pruned:oscilloscope

@article{hooker2019selective, title={Selective Brain Damage: Measuring the Disparate Impact of Model Pruning}, author={Sara Hooker and Aaron Courville and Yann Dauphin and Andrea Frome}, year={2019}, url={https://arxiv.org/abs/1911.05248}, eprint={1911.05248}, archivePrefix={arXiv}, primaryClass={cs.LG} }

Selective Brain Damage: Measuring the Disparate Impact of Model Compression

What is lost when we prune deep neural networks?

PIE: Pruning Identified Exemplars

What class categories are impacted by pruning?

What does this mean for the use of pruned models?

Acknowledgments

Citation

Bibliography