| |
Research
My main research area is Digital Media, specifically focused on organizing large collections of
images with associated text through the use of techniques from Natural Language Processing and
Computer Vision. Today billions of images with associated text are available in web pages,
captioned photographs from news sources, video with speech or closed captioning, and others. In
order to organize, search and exploit these enormous collections we have developed methods that
combine information from both the visual and textual sources effectively. Past projects include:
automatically identifying people in news photographs, classifying images from the web, and
finding iconic images in consumer photo collections. I am also generally interested in bringing
together people and expertise from various areas of Digital Media including digital art, music,
and cultural studies.
Bio
I graduated with a Ph.D. from the Computer Science Department at UC, Berkeley in the Spring of 2007 under the
advisorship of Professor David Forsyth and was a member of the Berkeley Computer Vision Group.
I spent 2007-2008 as a post-doc at
Yahoo! Research devloping various digital media related projects
including the automatic annotation of consumer photographs. I am currently
an Assistant Professor at Stony Brook University.
Teaching
Spring 2012 - CSE/ISE 364 Advanced Multimedia
Spring 2012 - CSE 591 Recognizing People, Objects, and Actions
Fall 2011 - CSE 590 Computational Photography
Spring 2011 - CSE 595 Words & Pictures
Spring 2011 - CSE/ISE 364 Advanced Multimedia
Spring 2010 - CSE/ISE 364 Advanced Multimedia
Fall 2009 - CSE 591 Recognizing People, Objects, and Actions
Spring 2009 - CSE/ISE 364 Advanced Multimedia
Fall 2008 - CSE 690 Internet Vision
Students
Vicente Ordonez (PhD)
Kota Yamaguchi (PhD)
Hadi Kiapour (PhD)
Chaitanya Kommini (MS Indepdendent Study)
Former Students
Deepak Venkatachalam (MS Independent Study)
Farheen Noorie (MS Independent Study)
Girish Kulkarni (MS) - 2011 Epic Systems
Debaleena Chattopadhy (MS) - 2011 Indiana School of Informatics PhD
Sagnik Dhar (MS) 2010 - Honda Research
Visruth Premraj (MS) 2010 Epic Systems
Erin Palmer (MS) 2009 - Factset
Jose Villa (MS) 2010
Piyush Kumat, (MS Indendent Study) Fall 2009
Current Funding
NSF Faculty Early Career Development (CAREER) Program: Award #1054133 - Toward a General Framework for Words & Pictures. Project Page
IIS Core: Award #1161876 - RI: Medium: Integrating Humans and Computers for Image and Video Understanding
Seeing Social: Exploiting Computer Vision in Online Communities. Google Faculty Research Award
SBU/BNL Seed Grant: "The Data Sensorium: Multi-Modal Explorations of Scientific Data". Personel - Dan Weymouth, Kevin Yager, Tamara Berg, Margaret Schedel, Klaus Mueller, Dimitris Samaras, Tony Phillips, Rita Goldstein, Nelly Alia-Klein, Zabet Patterson.
NSF MRI-R2 grant: "Development of an Immersive Giga-pixel Display". Contributor as Senior Personel.
Past Funding
Stony Brook FAHSS grant: "Encountering Data". Daniel Weymouth, Tamara Berg, Zabet Patterson, Margaret Schedel, John Lutterbie.
Stony Brook FAHSS grant: "Hybrid Geographies". Zabet Patterson, Christa Erickson, Margaret Schedel, Tamara Berg, Raiford Guins, Andrew Uroskie.
| |
Publications
Parsing Clothing in Fashion Photographs
[pdf]
Kota Yamaguchi,
Hadi Kiapour,
Luis E. Ortiz,
Tamara L Berg
Computer Vision and Pattern Recognition, CVPR 2012.
Understanding and Predicting Importance in Images
[pdf]
Karl Stratos,
Aneesh Sood,
Alyssa Mensch,
Xufeng Han,
Margaret Mitchell,
Kota Yamaguchi,
Jesse Dodge,
Amit Goyal,
Hal Daume III,
Alex Berg,
Tamara L Berg
Computer Vision and Pattern Recognition, CVPR 2012.
Collective Generation of Natural Image Descriptions
[pdf]
Polina Kuznetsova,
Vicente Ordonez,
Alex Berg,
Tamara L Berg,
Yejin Choi
Association for Computational Linguistics. ACL 2012.
Detecting Visual Text
[pdf]
Jesse Dodge,
Amit Goyal,
Xufeng Han,
Alyssa Mensch,
Margaret Mitchell,
Karl Stratos,
Kota Yamaguchi,
Yejin Choi,
Hal Daume III,
Alex C Berg,
Tamara L Berg,
North American Chapter of the Association for Computational Linguistics. NAACL 2012.
Midge: Generating Image Descriptions From Computer Vision Detections
[pdf]
Margaret Mitchell,
Jesse Dodge,
Amit Goyal,
Kota Yamaguchi,
Karl Sratos,
Xufeng Han,
Alysssa Mensch,
Alex Berg,
Tamara L. Berg,
Hal Daume III
European Chapter of the Association for computational Linguistics, EACL 2012.
Interactive Music: Human Motion Initiated Music Generation
Using Skeletal Tracking By Kinect
[pdf]
Tamara L. Berg,
Debaleena Chattopadhyay,
Margaret Schedel,
Timothy Vallier
SEAMUS, 2012.
- Im2Text: Describing Images Using 1 Million Captioned Photographs
[pdf]
Vicente Ordonez,,
Girish Kulkarni,,
Tamara L. Berg
Neural Information Processing Systems (NIPS), 2011.
Dataset: SBU Captioned Photo Dataset
- Composing Simple Image Descriptions using Web-scale N-grams.
[pdf]
Siming Li,
Girish Kulkarni,
Tamara L. Berg,
Alexander C. Berg,
Yejin Choi
Computational Natural Language Learning (CoNLL), 2011.
- Iconizer: A Framework to Identify and Create Effective
Representations for Visual Information Encoding
[pdf]
Supriya Garg,
Tamara L. Berg,
Klaus Mueller
The 11th International Symposium on Smart Graphics (SG), 2011
- Baby Talk: Understanding and Generating Simple Image Descriptions
[pdf]
Girish Kulkarni,
Visruth Premraj,
Sagnik Dhar,
Siming Li,
Yejin Choi,
Alexander C. Berg,
Tamara L. Berg
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2011 (ORAL)
- High Level Describable Attributes for Predicting Aesthetics and Interestingness
[pdf]
Sagnik Dhar,
Vicente Ordonez,
Tamara L. Berg,
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2011
- Who are you with and where are you going?
[pdf]
Kota Yamaguchi,
Alexander C. Berg,
Luis Ortiz
Tamara L. Berg,
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2011
- Can Computers Master the Art of Communication? An Excursion with a Focus on Visual Analytics
Klaus Mueller,
Supriya Garg,
Julia Nam,
Tamara L. Berg,
Kevin McDonnell.
IEEE Computer Graphics and Applications, May/June 2011.
- Automatic Attribute Discovery and Characterization from Noisy Web Data
[pdf]
Tamara L. Berg,
Alexander C. Berg,
Jonathan Shih
The European Conference on Computer Vision (ECCV) 2010.
Dataset: Attribute Discovery Dataset
- iWalk, A Tool for Interacting with Geo-Located Data Through Movement and Gesture
[pdf]
Visruth Premraj,
Margaret Schedel,
Tamara L. Berg,
ACM Multimedia, Human Centered Multimedia Track (ACM MM) 2010.
- It's All About the Data
Tamara L. Berg, Alexander Sorokin, Gang Wang, David A. Forsyth, Derek Hoiem, Ali Farhadi, Ian Endres.
Proceedings of the IEEE, Special Issue on Internet Vision, August 2010, 98-8, 1434-1453.
- Finding Iconic Images
[pdf]
[ps]
Tamara L. Berg,
Alexander C. Berg,
The 2nd Internet Vision Workshop at Conference on Computer Vision and Pattern Recognition (CVPR) 2009.
- Words and Pictures: Categories, Modifiers, Depiction and Iconography
D.A. Forsyth, T.L. Berg, C. Alm, A. Farhadi, J. Hockenmaier, N. Loeff, G. Wang.
Object Categorization: Computer and Human Vision Perspectives. Cambridge University Press, 2009, in press. Sven Dickinson, Michael Tarr, Ales Leonardis, Bernt Schiele (eds)
- Names and Faces
[pdf]
[ps]
Tamara L. Berg,
Alexander C. Berg,
Jaety Edwards,
Michael Maire,
Ryan White,
Yee Whye Teh,
Erik Learned-Miller,
David A. Forsyth
In Submission
- Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments [pdf]
Gary B. Huang, Marwan Mattar, Tamara Berg, and Erik Learned-Miller.
The Workshop on Faces in Real-Life Images at European Conference on Computer Vision (ECCV) 2008.
- Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments [pdf]
Gary B. Huang, Manu Ramesh, Tamara Berg, and Erik Learned-Miller.
University of Massachusetts, Amherst, Technical Report 07-49, October, 2007
- Exploiting Words and Pictures
[pdf]
Tamara L. Berg
U.C. Berkeley Ph.D. Thesis, May. 2007
- Dataset Issues in Object Recognition
[pdf]
[ps]
J. Ponce, T. L. Berg, M. Everingham, D.A. Forsyth, M. Hebert, S. Lazebnik, M. Marszalek, C. Schmid, B.C. Russell, A. Torralba, C.K.I. Williams, J. Zhang and A. Zisserman,
Toward Category-Level Object Recognition,
Springer-Verlag Lecture Notes in Computer Science. J. Ponce, M. Hebert, C. Schmid and A. Zisserman (eds.), Feb 2007.
- Automatic Ranking of Iconic Images
[pdf]
[ps]
Tamara L. Berg,
David A. Forsyth
U.C. Berkeley Technical Report, Jan. 2007
- Names and Faces
[pdf]
[ps]
Tamara L. Berg,
Alexander C. Berg,
Jaety Edwards,
Michael Maire,
Ryan White,
Yee Whye Teh,
Erik Learned-Miller,
David A. Forsyth
U.C. Berkeley Technical Report, Jan. 2007
- Animals on the Web
[pdf]
[ps]
Tamara L. Berg,
David A. Forsyth
Computer Vision and Pattern Recognition (CVPR) 2006
Demo: Animals on the Web
Dataset: Animals on the Web Dataset
-
Shape Matching and Object Recognition using Low Distortion Correspondence
[pdf]
[ps]
[ppt]
Alexander C. Berg,
Tamara L. Berg,
Jitendra Malik
Computer Vision and Pattern Recognition (CVPR), 2005
- Shape Matching and Object Recognition using Low Distortion Correspondence
[pdf]
[ps]
Alexander C. Berg,
Tamara L. Berg,
Jitendra Malik
U.C. Berkeley Technical Report, Dec. 2004
- Who's in the Picture?
[pdf]
[ps]
Tamara L. Berg,
Alexander C. Berg,
Jaety Edwards,
David A. Forsyth
Neural Information Processing Systems (NIPS), 2004
Demo: Face Dictionary
Dataset: Faces In the Wild
Dataset: Labeled Faces In the Wild
- Names and Faces in the News
[pdf]
[ps]
Tamara L. Berg,
Alexander C. Berg,
Jaety Edwards,
Michael Maire,
Ryan White,
Yee Whye Teh,
Erik Learned-Miller,
David A. Forsyth
Computer Vision and Pattern Recognition (CVPR), 2004
|
Alex Berg, my husband.
Arnold Miller, my dad.
|