Jump to content

Pfinder

From Wikipedia, the free encyclopedia

Pfinder is a computer vision system which detects features in video images in order to recognize human figures and their movements and gestures. Pfinder was designed by Wren, et al.[1] of the MIT Media Laboratory in 1997. As described by its authors, Pfinder is a "real-time system for tracking people and interpreting their behavior". The system improves upon previous works by not only identifying the boundaries of a person in the image, but also analyzing the regions inside the boundaries and relating them to the known structure of the human body. As an example, Pfinder can track a person's head and hands, and can determine the pose of the body and recognize gestures.

Limitations

[edit]
  • Pfinder does not cope with multi-modal backgrounds, in which a histogram of the pixel intensity contains more than one distinct peak.[2]
  • While it can handle small or gradual changes in lighting, it does not react well to large, sudden lighting changes. When there are large lighting changes, the system mistakenly labels them as a part of the foreground, and therefore tries to incorporate them into the human figure model.[1]
  • Pfinder is not able to handle multiple people in the same image well. While the blobs representing each person would be detected, the system would attempt to analyze them as one distinct human figure.[1]

Applications

[edit]

See also

[edit]

References

[edit]
  1. ^ a b c d Wren, Christopher Richard; Ali Azarbayejani; Ali Azarbayejani; Alex Paul Pentland (July 1997). "Pfinder: Real-Time Tracking of the Human Body" (PDF). IEEE Transactions on Pattern Analysis and Machine Intelligence. 19 (7): 780–785. doi:10.1109/34.598236. hdl:1721.1/10652. Archived from the original (PDF) on 9 June 2007. Retrieved 24 July 2012.
  2. ^ Piccardi, Massimo (15 April 2004). "Background subtraction techniques: a review" (PDF). University of Technology, Sydney. p. 13. Archived from the original (PDF) on 4 March 2016. Retrieved 24 July 2012. It does not cope with multimodal backgrounds