1 Star 0 Fork 0

changjy7628/awesome-computer-vision

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
该仓库未声明开源许可证文件(LICENSE),使用请关注具体项目描述及其代码上游依赖。
克隆/下载
贡献代码
同步代码
取消
提示: 由于 Git 不支持空文件夾,创建文件夹后会生成空的 .keep 文件
Loading...
README
CC0-1.0
# Awesome Computer Vision: [![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/sindresorhus/awesome) A curated list of awesome computer vision resources, inspired by [awesome-php](https://github.com/ziadoz/awesome-php). For a list people in computer vision listed with their academic genealogy, please visit [here](https://github.com/jbhuang0604/awesome-computer-vision/blob/master/people.md) ## Contributing Please feel free to send me [pull requests](https://github.com/jbhuang0604/awesome-computer-vision/pulls) or email (jbhuang1@illinois.edu) to add links. ## Table of Contents - [Books](#books) - [Courses](#courses) - [Papers](#papers) - [Software](#software) - [Datasets](#datasets) - [Tutorials and Talks](#tutorials-and-talks) - [Resources for students](#resources-for-students) - [Blogs](#blogs) - [Links](#links) - [Songs](#songs) ## Books #### Computer Vision * [Computer Vision: Models, Learning, and Inference](http://www.computervisionmodels.com/) - Simon J. D. Prince 2012 * [Computer Vision: Theory and Application](http://szeliski.org/Book/) - Rick Szeliski 2010 * [Computer Vision: A Modern Approach (2nd edition)](http://www.amazon.com/Computer-Vision-Modern-Approach-2nd/dp/013608592X/ref=dp_ob_title_bk) - David Forsyth and Jean Ponce 2011 * [Multiple View Geometry in Computer Vision](http://www.robots.ox.ac.uk/~vgg/hzbook/) - Richard Hartley and Andrew Zisserman 2004 * [Computer Vision](http://www.amazon.com/Computer-Vision-Linda-G-Shapiro/dp/0130307963) - Linda G. Shapiro 2001 * [Vision Science: Photons to Phenomenology](http://www.amazon.com/Vision-Science-Phenomenology-Stephen-Palmer/dp/0262161834/) - Stephen E. Palmer 1999 * [Visual Object Recognition synthesis lecture](http://www.morganclaypool.com/doi/abs/10.2200/S00332ED1V01Y201103AIM011) - Kristen Grauman and Bastian Leibe 2011 * [Computer Vision for Visual Effects](http://cvfxbook.com/) - Richard J. Radke, 2012 * [High dynamic range imaging: acquisition, display, and image-based lighting](http://www.amazon.com/High-Dynamic-Range-Imaging-Second/dp/012374914X) - Reinhard, E., Heidrich, W., Debevec, P., Pattanaik, S., Ward, G., Myszkowski, K 2010 * [Numerical Algorithms: Methods for Computer Vision, Machine Learning, and Graphics](https://people.csail.mit.edu/jsolomon/share/book/numerical_book.pdf) - Justin Solomon 2015 #### OpenCV Programming * [Learning OpenCV: Computer Vision with the OpenCV Library](http://www.amazon.com/Learning-OpenCV-Computer-Vision-Library/dp/0596516134) - Gary Bradski and Adrian Kaehler * [Practical Python and OpenCV](https://www.pyimagesearch.com/practical-python-opencv/) - Adrian Rosebrock * [OpenCV Essentials](http://www.amazon.com/OpenCV-Essentials-Oscar-Deniz-Suarez/dp/1783984244/ref=sr_1_1?s=books&ie=UTF8&qid=1424594237&sr=1-1&keywords=opencv+essentials#) - Oscar Deniz Suarez, Mª del Milagro Fernandez Carrobles, Noelia Vallez Enano, Gloria Bueno Garcia, Ismael Serrano Gracia #### Machine Learning * [Pattern Recognition and Machine Learning](http://research.microsoft.com/en-us/um/people/cmbishop/prml/index.htm) - Christopher M. Bishop 2007 * [Neural Networks for Pattern Recognition](http://www.engineering.upm.ro/master-ie/sacpi/mat_did/info068/docum/Neural%20Networks%20for%20Pattern%20Recognition.pdf) - Christopher M. Bishop 1995 * [Probabilistic Graphical Models: Principles and Techniques](http://pgm.stanford.edu/) - Daphne Koller and Nir Friedman 2009 * [Pattern Classification](http://www.amazon.com/Pattern-Classification-2nd-Richard-Duda/dp/0471056693) - Peter E. Hart, David G. Stork, and Richard O. Duda 2000 * [Machine Learning](http://www.amazon.com/Machine-Learning-Tom-M-Mitchell/dp/0070428077/) - Tom M. Mitchell 1997 * [Gaussian processes for machine learning](http://www.gaussianprocess.org/gpml/) - Carl Edward Rasmussen and Christopher K. I. Williams 2005 * [Learning From Data](https://work.caltech.edu/telecourse.html)- Yaser S. Abu-Mostafa, Malik Magdon-Ismail and Hsuan-Tien Lin 2012 * [Neural Networks and Deep Learning](http://neuralnetworksanddeeplearning.com/) - Michael Nielsen 2014 * [Bayesian Reasoning and Machine Learning](http://www.cs.ucl.ac.uk/staff/d.barber/brml/) - David Barber, Cambridge University Press, 2012 #### Fundamentals * [Linear Algebra and Its Applications](http://www.amazon.com/Linear-Algebra-Its-Applications-4th/dp/0030105676/ref=sr_1_4?ie=UTF8&qid=1421433773&sr=8-4&keywords=Linear+Algebra+and+Its+Applications) - Gilbert Strang 1995 ## Courses #### Computer Vision * [EENG 512 / CSCI 512 - Computer Vision](http://inside.mines.edu/~whoff/courses/EENG512/) - William Hoff (Colorado School of Mines) * [Visual Object and Activity Recognition](https://sites.google.com/site/ucbcs29443/) - Alexei A. Efros and Trevor Darrell (UC Berkeley) * [Computer Vision](http://courses.cs.washington.edu/courses/cse455/12wi/) - Steve Seitz (University of Washington) * Visual Recognition [Spring 2016](http://vision.cs.utexas.edu/381V-spring2016/), [Fall 2016](http://vision.cs.utexas.edu/381V-fall2016/) - Kristen Grauman (UT Austin) * [Language and Vision](http://www.tamaraberg.com/teaching/Spring_15/) - Tamara Berg (UNC Chapel Hill) * [Convolutional Neural Networks for Visual Recognition](http://vision.stanford.edu/teaching/cs231n/) - Fei-Fei Li and Andrej Karpathy (Stanford University) * [Computer Vision](http://cs.nyu.edu/~fergus/teaching/vision/index.html) - Rob Fergus (NYU) * [Computer Vision](https://courses.engr.illinois.edu/cs543/sp2015/) - Derek Hoiem (UIUC) * [Computer Vision: Foundations and Applications](http://vision.stanford.edu/teaching/cs131_fall1415/index.html) - Kalanit Grill-Spector and Fei-Fei Li (Stanford University) * [High-Level Vision: Behaviors, Neurons and Computational Models](http://vision.stanford.edu/teaching/cs431_spring1314/) - Fei-Fei Li (Stanford University) * [Advances in Computer Vision](http://6.869.csail.mit.edu/fa15/) - Antonio Torralba and Bill Freeman (MIT) * [Computer Vision](http://www.vision.rwth-aachen.de/course/11/) - Bastian Leibe (RWTH Aachen University) * [Computer Vision 2](http://www.vision.rwth-aachen.de/course/9/) - Bastian Leibe (RWTH Aachen University) * [Computer Vision](http://klewel.com/conferences/epfl-computer-vision/) Pascal Fua (EPFL): * [Computer Vision 1](http://cvlab-dresden.de/courses/computer-vision-1/) Carsten Rother (TU Dresden): * [Computer Vision 2](http://cvlab-dresden.de/courses/CV2/) Carsten Rother (TU Dresden): * [Multiple View Geometry](https://youtu.be/RDkwklFGMfo?list=PLTBdjV_4f-EJn6udZ34tht9EVIW7lbeo4) Daniel Cremers (TU Munich): #### Computational Photography * [Image Manipulation and Computational Photography](http://inst.eecs.berkeley.edu/~cs194-26/fa14/) - Alexei A. Efros (UC Berkeley) * [Computational Photography](http://graphics.cs.cmu.edu/courses/15-463/2012_fall/463.html) - Alexei A. Efros (CMU) * [Computational Photography](https://courses.engr.illinois.edu/cs498dh3/) - Derek Hoiem (UIUC) * [Computational Photography](http://cs.brown.edu/courses/csci1290/) - James Hays (Brown University) * [Digital & Computational Photography](http://stellar.mit.edu/S/course/6/sp12/6.815/) - Fredo Durand (MIT) * [Computational Camera and Photography](http://ocw.mit.edu/courses/media-arts-and-sciences/mas-531-computational-camera-and-photography-fall-2009/) - Ramesh Raskar (MIT Media Lab) * [Computational Photography](https://www.udacity.com/course/computational-photography--ud955) - Irfan Essa (Georgia Tech) * [Courses in Graphics](http://graphics.stanford.edu/courses/) - Stanford University * [Computational Photography](http://cs.nyu.edu/~fergus/teaching/comp_photo/index.html) - Rob Fergus (NYU) * [Introduction to Visual Computing](http://www.cs.toronto.edu/~kyros/courses/320/) - Kyros Kutulakos (University of Toronto) * [Computational Photography](http://www.cs.toronto.edu/~kyros/courses/2530/) - Kyros Kutulakos (University of Toronto) * [Computer Vision for Visual Effects](https://www.ecse.rpi.edu/~rjradke/cvfxcourse.html) - Rich Radke (Rensselaer Polytechnic Institute) * [Introduction to Image Processing](https://www.ecse.rpi.edu/~rjradke/improccourse.html) - Rich Radke (Rensselaer Polytechnic Institute) #### Machine Learning and Statistical Learning * [Machine Learning](https://www.coursera.org/learn/machine-learning) - Andrew Ng (Stanford University) * [Learning from Data](https://work.caltech.edu/telecourse.html) - Yaser S. Abu-Mostafa (Caltech) * [Statistical Learning](https://class.stanford.edu/courses/HumanitiesandScience/StatLearning/Winter2015/about) - Trevor Hastie and Rob Tibshirani (Stanford University) * [Statistical Learning Theory and Applications](http://www.mit.edu/~9.520/fall14/) - Tomaso Poggio, Lorenzo Rosasco, Carlo Ciliberto, Charlie Frogner, Georgios Evangelopoulos, Ben Deen (MIT) * [Statistical Learning](http://www.stat.rice.edu/~gallen/stat640.html) - Genevera Allen (Rice University) * [Practical Machine Learning](http://www.cs.berkeley.edu/~jordan/courses/294-fall09/) - Michael Jordan (UC Berkeley) * [Course on Information Theory, Pattern Recognition, and Neural Networks](http://videolectures.net/course_information_theory_pattern_recognition/) - David MacKay (University of Cambridge) * [Methods for Applied Statistics: Unsupervised Learning](http://web.stanford.edu/~lmackey/stats306b/) - Lester Mackey (Stanford) * [Machine Learning](http://www.robots.ox.ac.uk/~az/lectures/ml/index.html) - Andrew Zisserman (University of Oxford) * [Intro to Machine Learning](https://www.udacity.com/course/intro-to-machine-learning--ud120) - Sebastian Thrun (Stanford University) * [Machine Learning](https://www.udacity.com/course/machine-learning--ud262) - Charles Isbell, Michael Littman (Georgia Tech) * [(Convolutional) Neural Networks for Visual Recognition](https://cs231n.github.io/) - Fei-Fei Li, Andrej Karphaty, Justin Johnson (Stanford University) * [Machine Learning for Computer Vision](https://youtu.be/QZmZFeZxEKI?list=PLTBdjV_4f-EIiongKlS9OKrBEp8QR47Wl) - Rudolph Triebel (TU Munich) #### Optimization * [Convex Optimization I](http://stanford.edu/class/ee364a/) - Stephen Boyd (Stanford University) * [Convex Optimization II](http://stanford.edu/class/ee364b/) - Stephen Boyd (Stanford University) * [Convex Optimization](https://class.stanford.edu/courses/Engineering/CVX101/Winter2014/about) - Stephen Boyd (Stanford University) * [Optimization at MIT](http://optimization.mit.edu/classes.php) - (MIT) * [Convex Optimization](http://www.stat.cmu.edu/~ryantibs/convexopt/) - Ryan Tibshirani (CMU) ## Papers #### Conference papers on the web * [CVPapers](http://www.cvpapers.com/) - Computer vision papers on the web * [SIGGRAPH Paper on the web](http://kesen.realtimerendering.com/) - Graphics papers on the web * [NIPS Proceedings](http://papers.nips.cc/) - NIPS papers on the web * [Computer Vision Foundation open access](http://www.cv-foundation.org/openaccess/menu.py) * [Annotated Computer Vision Bibliography](http://iris.usc.edu/Vision-Notes/bibliography/contents.html) - Keith Price (USC) * [Calendar of Computer Image Analysis, Computer Vision Conferences](http://iris.usc.edu/Information/Iris-Conferences.html) - (USC) #### Survey Papers * [Visionbib Survey Paper List](http://surveys.visionbib.com/index.html) * [Foundations and Trends® in Computer Graphics and Vision](http://www.nowpublishers.com/CGV) * [Computer Vision: A Reference Guide](http://link.springer.com/book/10.1007/978-0-387-31439-6) ## Tutorials and talks #### Computer Vision * [Computer Vision Talks](http://www.computervisiontalks.com/) - Lectures, keynotes, panel discussions on computer vision * [The Three R's of Computer Vision](https://www.youtube.com/watch?v=Mqg6eorYRIQ) - Jitendra Malik (UC Berkeley) 2013 * [Applications to Machine Vision](http://videolectures.net/epsrcws08_blake_amv/) - Andrew Blake (Microsoft Research) 2008 * [The Future of Image Search](http://videolectures.net/kdd08_malik_fis/?q=image) - Jitendra Malik (UC Berkeley) 2008 * [Should I do a PhD in Computer Vision?](https://www.youtube.com/watch?v=M17oGxh3Ny8) - Fatih Porikli (Australian National University) - [Graduate Summer School 2013: Computer Vision](http://www.ipam.ucla.edu/programs/summer-schools/graduate-summer-school-computer-vision/?tab=schedule) - IPAM, 2013 #### Recent Conference Talks - [CVPR 2015](http://www.pamitc.org/cvpr15/) - Jun 2015 - [ECCV 2014](http://videolectures.net/eccv2014_zurich/) - Sep 2014 - [CVPR 2014](http://techtalks.tv/cvpr-2014-oral-talks/) - Jun 2014 - [ICCV 2013](http://techtalks.tv/iccv2013/) - Dec 2013 - [ICML 2013](http://techtalks.tv/icml/2013/) - Jul 2013 - [CVPR 2013](http://techtalks.tv/cvpr2013/) - Jun 2013 - [ECCV 2012](http://videolectures.net/eccv2012_firenze/) - Oct 2012 - [ICML 2012](http://techtalks.tv/icml/2012/orals/) - Jun 2012 - [CVPR 2012](http://techtalks.tv/cvpr2012webcast/) - Jun 2012 #### 3D Computer Vision * [3D Computer Vision: Past, Present, and Future](https://www.youtube.com/watch?v=kyIzMr917Rc) - Steve Seitz (University of Washington) 2011 * [Reconstructing the World from Photos on the Internet](https://www.youtube.com/watch?v=04Kgg3QEXFI) - Steve Seitz (University of Washington) 2013 #### Internet Vision * [The Distributed Camera](http://www.technologyreview.com/video/426265/meet-2011-tr35-winner-noah-snavely/) - Noah Snavely (Cornell University) 2011 * [Planet-Scale Visual Understanding](https://www.youtube.com/watch?v=UHkCa9-Z1Ps) - Noah Snavely (Cornell University) 2014 * [A Trillion Photos](https://www.youtube.com/watch?v=6MWEfpKUfRc) - Steve Seitz (University of Washington) 2013 #### Computational Photography * [Reflections on Image-Based Modeling and Rendering](https://www.youtube.com/watch?v=j90_0Ndk7XM) - Richard Szeliski (Microsoft Research) 2013 * [Photographing Events over Time](https://www.youtube.com/watch?v=ZvPaHZZVPRk) - William T. Freeman (MIT) 2011 * [Old and New algorithm for Blind Deconvolution](http://videolectures.net/nipsworkshops2011_weiss_deconvolution/) - Yair Weiss (The Hebrew University of Jerusalem) 2011 * [A Tour of Modern "Image Processing"](http://videolectures.net/nipsworkshops2010_milanfar_tmi/) - Peyman Milanfar (UC Santa Cruz/Google) 2010 * [Topics in image and video processing](http://videolectures.net/mlss07_blake_tiivp/) Andrew Blake (Microsoft Research) 2007 * [Computational Photography](https://www.youtube.com/watch?v=HJVNI0mkmqk) - William T. Freeman (MIT) 2012 * [Revealing the Invisible](https://www.youtube.com/watch?v=_BWnIQY_X98) - Frédo Durand (MIT) 2012 * [Overview of Computer Vision and Visual Effects](https://www.youtube.com/watch?v=rE-hVtytT-I) - Rich Radke (Rensselaer Polytechnic Institute) 2014 #### Learning and Vision * [Where machine vision needs help from machine learning](http://videolectures.net/colt2011_freeman_help/?q=computer%20vision) - William T. Freeman (MIT) 2011 * [Learning in Computer Vision](http://videolectures.net/mlss08au_lucey_linv/) - Simon Lucey (CMU) 2008 * [Learning and Inference in Low-Level Vision](http://videolectures.net/nips09_weiss_lil/?q=computer%20vision) - Yair Weiss (The Hebrew University of Jerusalem) 2009 #### Object Recognition * [Object Recognition](http://research.microsoft.com/apps/video/dl.aspx?id=231358) - Larry Zitnick (Microsoft Research) * [Generative Models for Visual Objects and Object Recognition via Bayesian Inference](http://videolectures.net/mlas06_li_gmvoo/?q=Fei-Fei%20Li) - Fei-Fei Li (Stanford University) #### Graphical Models * [Graphical Models for Computer Vision](http://videolectures.net/uai2012_felzenszwalb_computer_vision/?q=computer%20vision) - Pedro Felzenszwalb (Brown University) 2012 * [Graphical Models](http://videolectures.net/mlss09uk_ghahramani_gm/) - Zoubin Ghahramani (University of Cambridge) 2009 * [Machine Learning, Probability and Graphical Models](http://videolectures.net/mlss06tw_roweis_mlpgm/) - Sam Roweis (NYU) 2006 * [Graphical Models and Applications](http://videolectures.net/mlss09us_weiss_gma/?q=Graphical%20Models) - Yair Weiss (The Hebrew University of Jerusalem) 2009 #### Machine Learning * [A Gentle Tutorial of the EM Algorithm](https://nikola-rt.ee.washington.edu/people/bulyko/papers/em.pdf) - Jeff A. Bilmes (UC Berkeley) 1998 * [Introduction To Bayesian Inference](http://videolectures.net/mlss09uk_bishop_ibi/) - Christopher Bishop (Microsoft Research) 2009 * [Support Vector Machines](http://videolectures.net/mlss06tw_lin_svm/) - Chih-Jen Lin (National Taiwan University) 2006 * [Bayesian or Frequentist, Which Are You? ](http://videolectures.net/mlss09uk_jordan_bfway/) - Michael I. Jordan (UC Berkeley) #### Optimization * [Optimization Algorithms in Machine Learning](http://videolectures.net/nips2010_wright_oaml/) - Stephen J. Wright (University of Wisconsin-Madison) * [Convex Optimization](http://videolectures.net/mlss07_vandenberghe_copt/?q=convex%20optimization) - Lieven Vandenberghe (University of California, Los Angeles) * [Continuous Optimization in Computer Vision](https://www.youtube.com/watch?v=oZqoWozVDVg) - Andrew Fitzgibbon (Microsoft Research) * [Beyond stochastic gradient descent for large-scale machine learning](http://videolectures.net/sahd2014_bach_stochastic_gradient/) - Francis Bach (INRIA) * [Variational Methods for Computer Vision](https://www.youtube.com/playlist?list=PLTBdjV_4f-EJ7A2iIH5L5ztqqrWYjP2RI) - Daniel Cremers (Technische Universität München) ([lecture 18 missing from playlist](https://www.youtube.com/watch?v=GgcbVPNd3SI)) #### Deep Learning * [A tutorial on Deep Learning](http://videolectures.net/jul09_hinton_deeplearn/) - Geoffrey E. Hinton (University of Toronto) * [Deep Learning](http://videolectures.net/kdd2014_salakhutdinov_deep_learning/?q=Hidden%20Markov%20model#) - Ruslan Salakhutdinov (University of Toronto) * [Scaling up Deep Learning](http://videolectures.net/kdd2014_bengio_deep_learning/) - Yoshua Bengio (University of Montreal) * [ImageNet Classification with Deep Convolutional Neural Networks](http://videolectures.net/machine_krizhevsky_imagenet_classification/?q=deep%20learning) - Alex Krizhevsky (University of Toronto) * [The Unreasonable Effectivness Of Deep Learning](http://videolectures.net/sahd2014_lecun_deep_learning/) Yann LeCun (NYU/Facebook Research) 2014 * [Deep Learning for Computer Vision](https://www.youtube.com/watch?v=qgx57X0fBdA) - Rob Fergus (NYU/Facebook Research) * [High-dimensional learning with deep network contractions](http://videolectures.net/sahd2014_mallat_dimensional_learning/) - Stéphane Mallat (Ecole Normale Superieure) * [Graduate Summer School 2012: Deep Learning, Feature Learning](http://www.ipam.ucla.edu/programs/summer-schools/graduate-summer-school-deep-learning-feature-learning/?tab=schedule) - IPAM, 2012 * [Workshop on Big Data and Statistical Machine Learning](http://www.fields.utoronto.ca/programs/scientific/14-15/bigdata/machine/) * [Machine Learning Summer School](https://www.youtube.com/channel/UC3ywjSv5OsDiDAnOP8C1NiQ) - Reykjavik, Iceland 2014 * [Deep Learning Session 1](https://www.youtube.com/watch?v=JuimBuvEWBg) - Yoshua Bengio (Universtiy of Montreal) * [Deep Learning Session 2](https://www.youtube.com/watch?v=Fl-W7_z3w3o) - Yoshua Bengio (University of Montreal) * [Deep Learning Session 3](https://www.youtube.com/watch?v=_cohR7LAgWA) - Yoshua Bengio (University of Montreal) ## Software #### Annotation tools * [Comma Coloring](http://commacoloring.herokuapp.com/) * [Annotorious](https://annotorious.github.io/) * [LabelME](http://labelme.csail.mit.edu/Release3.0/) * [gtmaker](https://github.com/sanko-shoko/gtmaker) #### External Resource Links * [Computer Vision Resources](https://sites.google.com/site/jbhuang0604/resources/vision) - Jia-Bin Huang (UIUC) * [Computer Vision Algorithm Implementations](http://www.cvpapers.com/rr.html) - CVPapers * [Source Code Collection for Reproducible Research](http://www.csee.wvu.edu/~xinl/reproducible_research.html) - Xin Li (West Virginia University) * [CMU Computer Vision Page](http://www.cs.cmu.edu/afs/cs/project/cil/ftp/html/v-source.html) #### General Purpose Computer Vision Library * [Open CV](http://opencv.org/) * [mexopencv](http://kyamagu.github.io/mexopencv/) * [SimpleCV](http://simplecv.org/) * [Open source Python module for computer vision](https://github.com/jesolem/PCV) * [ccv: A Modern Computer Vision Library](https://github.com/liuliu/ccv) * [VLFeat](http://www.vlfeat.org/) * [Matlab Computer Vision System Toolbox](http://www.mathworks.com/products/computer-vision/) * [Piotr's Computer Vision Matlab Toolbox](http://vision.ucsd.edu/~pdollar/toolbox/doc/index.html) * [PCL: Point Cloud Library](http://pointclouds.org/) * [ImageUtilities](https://gitorious.org/imageutilities) #### Multiple-view Computer Vision * [MATLAB Functions for Multiple View Geometry](http://www.robots.ox.ac.uk/~vgg/hzbook/code/) * [Peter Kovesi's Matlab Functions for Computer Vision and Image Analysis](http://staffhome.ecm.uwa.edu.au/~00011811/Research/MatlabFns/index.html) * [OpenGV ](http://laurentkneip.github.io/opengv/) - geometric computer vision algorithms * [MinimalSolvers](http://cmp.felk.cvut.cz/mini/) - Minimal problems solver * [Multi-View Environment](http://www.gcc.tu-darmstadt.de/home/proj/mve/) * [Visual SFM](http://ccwu.me/vsfm/) * [Bundler SFM](http://www.cs.cornell.edu/~snavely/bundler/) * [openMVG: open Multiple View Geometry](http://imagine.enpc.fr/~moulonp/openMVG/) - Multiple View Geometry; Structure from Motion library & softwares * [Patch-based Multi-view Stereo V2](http://www.di.ens.fr/pmvs/) * [Clustering Views for Multi-view Stereo](http://www.di.ens.fr/cmvs/) * [Floating Scale Surface Reconstruction](http://www.gris.informatik.tu-darmstadt.de/projects/floating-scale-surface-recon/) * [Large-Scale Texturing of 3D Reconstructions](http://www.gcc.tu-darmstadt.de/home/proj/texrecon/) * [Awesome 3D reconstruction list](https://github.com/openMVG/awesome_3DReconstruction_list) #### Feature Detection and Extraction * [VLFeat](http://www.vlfeat.org/) * [SIFT](http://www.cs.ubc.ca/~lowe/keypoints/) * David G. Lowe, "Distinctive image features from scale-invariant keypoints," International Journal of Computer Vision, 60, 2 (2004), pp. 91-110. * [SIFT++](http://www.robots.ox.ac.uk/~vedaldi/code/siftpp.html) * [BRISK](http://www.asl.ethz.ch/people/lestefan/personal/BRISK) * Stefan Leutenegger, Margarita Chli and Roland Siegwart, "BRISK: Binary Robust Invariant Scalable Keypoints", ICCV 2011 * [SURF](http://www.vision.ee.ethz.ch/~surf/) * Herbert Bay, Andreas Ess, Tinne Tuytelaars, Luc Van Gool, "SURF: Speeded Up Robust Features", Computer Vision and Image Understanding (CVIU), Vol. 110, No. 3, pp. 346--359, 2008 * [FREAK](http://www.ivpe.com/freak.htm) * A. Alahi, R. Ortiz, and P. Vandergheynst, "FREAK: Fast Retina Keypoint", CVPR 2012 * [AKAZE](http://www.robesafe.com/personal/pablo.alcantarilla/kaze.html) * Pablo F. Alcantarilla, Adrien Bartoli and Andrew J. Davison, "KAZE Features", ECCV 2012 * [Local Binary Patterns](https://github.com/nourani/LBP) #### High Dynamic Range Imaging * [HDR_Toolbox](https://github.com/banterle/HDR_Toolbox) #### Semantic Segmentation * [List of Semantic Segmentation algorithms](http://www.it-caesar.com/list-of-contemporary-semantic-segmentation-datasets/) #### Low-level Vision ###### Stereo Vision * [Middlebury Stereo Vision](http://vision.middlebury.edu/stereo/) * [The KITTI Vision Benchmark Suite](http://www.cvlibs.net/datasets/kitti/eval_stereo_flow.php?benchmark=stero) * [LIBELAS: Library for Efficient Large-scale Stereo Matching](http://www.cvlibs.net/software/libelas/) * [Ground Truth Stixel Dataset](http://www.6d-vision.com/ground-truth-stixel-dataset) ###### Optical Flow * [Middlebury Optical Flow Evaluation](http://vision.middlebury.edu/flow/) * [MPI-Sintel Optical Flow Dataset and Evaluation](http://sintel.is.tue.mpg.de/) * [The KITTI Vision Benchmark Suite](http://www.cvlibs.net/datasets/kitti/eval_stereo_flow.php?benchmark=flow) * [HCI Challenge](http://hci.iwr.uni-heidelberg.de/Benchmarks/document/Challenging_Data_for_Stereo_and_Optical_Flow/) * [Coarse2Fine Optical Flow](http://people.csail.mit.edu/celiu/OpticalFlow/) - Ce Liu (MIT) * [Secrets of Optical Flow Estimation and Their Principles](http://cs.brown.edu/~dqsun/code/cvpr10_flow_code.zip) * [C++/MatLab Optical Flow by C. Liu (based on Brox et al. and Bruhn et al.)](http://people.csail.mit.edu/celiu/OpticalFlow/) * [Parallel Robust Optical Flow by Sánchez Pérez et al.](http://www.ctim.es/research_works/parallel_robust_optical_flow/) ###### Image Denoising BM3D, KSVD, ###### Super-resolution * [Multi-frame image super-resolution](http://www.robots.ox.ac.uk/~vgg/software/SR/) * Pickup, L. C. Machine Learning in Multi-frame Image Super-resolution, PhD thesis 2008 * [Markov Random Fields for Super-Resolution](http://people.csail.mit.edu/billf/project%20pages/sresCode/Markov%20Random%20Fields%20for%20Super-Resolution.html) * W. T Freeman and C. Liu. Markov Random Fields for Super-resolution and Texture Synthesis. In A. Blake, P. Kohli, and C. Rother, eds., Advances in Markov Random Fields for Vision and Image Processing, Chapter 10. MIT Press, 2011 * [Sparse regression and natural image prior](https://people.mpi-inf.mpg.de/~kkim/supres/supres.htm) * K. I. Kim and Y. Kwon, "Single-image super-resolution using sparse regression and natural image prior", IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 6, pp. 1127-1133, 2010. * [Single-Image Super Resolution via a Statistical Model](http://www.cs.technion.ac.il/~elad/Various/SingleImageSR_TIP14_Box.zip) * T. Peleg and M. Elad, A Statistical Prediction Model Based on Sparse Representations for Single Image Super-Resolution, IEEE Transactions on Image Processing, Vol. 23, No. 6, Pages 2569-2582, June 2014 * [Sparse Coding for Super-Resolution](http://www.cs.technion.ac.il/~elad/Various/Single_Image_SR.zip) * R. Zeyde, M. Elad, and M. Protter On Single Image Scale-Up using Sparse-Representations, Curves & Surfaces, Avignon-France, June 24-30, 2010 (appears also in Lecture-Notes-on-Computer-Science - LNCS). * [Patch-wise Sparse Recovery](http://www.ifp.illinois.edu/~jyang29/ScSR.htm) * Jianchao Yang, John Wright, Thomas Huang, and Yi Ma. Image super-resolution via sparse representation. IEEE Transactions on Image Processing (TIP), vol. 19, issue 11, 2010. * [Neighbor embedding](http://www.jdl.ac.cn/user/hchang/doc/code.rar) * H. Chang, D.Y. Yeung, Y. Xiong. Super-resolution through neighbor embedding. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), vol.1, pp.275-282, Washington, DC, USA, 27 June - 2 July 2004. * [Deformable Patches](https://sites.google.com/site/yuzhushome/single-image-super-resolution-using-deformable-patches) * Yu Zhu, Yanning Zhang and Alan Yuille, Single Image Super-resolution using Deformable Patches, CVPR 2014 * [SRCNN](http://mmlab.ie.cuhk.edu.hk/projects/SRCNN.html) * Chao Dong, Chen Change Loy, Kaiming He, Xiaoou Tang, Learning a Deep Convolutional Network for Image Super-Resolution, in ECCV 2014 * [A+: Adjusted Anchored Neighborhood Regression](http://www.vision.ee.ethz.ch/~timofter/ACCV2014_ID820_SUPPLEMENTARY/index.html) * R. Timofte, V. De Smet, and L. Van Gool. A+: Adjusted Anchored Neighborhood Regression for Fast Super-Resolution, ACCV 2014 * [Transformed Self-Exemplars](https://sites.google.com/site/jbhuang0604/publications/struct_sr) * Jia-Bin Huang, Abhishek Singh, and Narendra Ahuja, Single Image Super-Resolution using Transformed Self-Exemplars, IEEE Conference on Computer Vision and Pattern Recognition, 2015 ###### Image Deblurring Non-blind deconvolution * [Spatially variant non-blind deconvolution](http://homes.cs.washington.edu/~shanqi/work/spvdeconv/) * [Handling Outliers in Non-blind Image Deconvolution](http://cg.postech.ac.kr/research/deconv_outliers/) * [Hyper-Laplacian Priors](http://cs.nyu.edu/~dilip/research/fast-deconvolution/) * [From Learning Models of Natural Image Patches to Whole Image Restoration](http://people.csail.mit.edu/danielzoran/epllcode.zip) * [Deep Convolutional Neural Network for Image Deconvolution](http://lxu.me/projects/dcnn/) * [Neural Deconvolution](http://webdav.is.mpg.de/pixel/neural_deconvolution/) Blind deconvolution * [Removing Camera Shake From A Single Photograph](http://www.cs.nyu.edu/~fergus/research/deblur.html) * [High-quality motion deblurring from a single image](http://www.cse.cuhk.edu.hk/leojia/projects/motion_deblurring/) * [Two-Phase Kernel Estimation for Robust Motion Deblurring](http://www.cse.cuhk.edu.hk/leojia/projects/robust_deblur/) * [Blur kernel estimation using the radon transform](http://people.csail.mit.edu/taegsang/Documents/RadonDeblurringCode.zip) * [Fast motion deblurring](http://cg.postech.ac.kr/research/fast_motion_deblurring/) * [Blind Deconvolution Using a Normalized Sparsity Measure](http://cs.nyu.edu//~dilip/research/blind-deconvolution/) * [Blur-kernel estimation from spectral irregularities](http://www.cs.huji.ac.il/~raananf/projects/deblur/) * [Efficient marginal likelihood optimization in blind deconvolution](http://www.wisdom.weizmann.ac.il/~levina/papers/LevinEtalCVPR2011Code.zip) * [Unnatural L0 Sparse Representation for Natural Image Deblurring](http://www.cse.cuhk.edu.hk/leojia/projects/l0deblur/) * [Edge-based Blur Kernel Estimation Using Patch Priors](http://cs.brown.edu/~lbsun/deblur2013/deblur2013iccp.html) * [Blind Deblurring Using Internal Patch Recurrence](http://www.wisdom.weizmann.ac.il/~vision/BlindDeblur.html) Non-uniform Deblurring * [Non-uniform Deblurring for Shaken Images](http://www.di.ens.fr/willow/research/deblurring/) * [Single Image Deblurring Using Motion Density Functions](http://grail.cs.washington.edu/projects/mdf_deblurring/) * [Image Deblurring using Inertial Measurement Sensors](http://research.microsoft.com/en-us/um/redmond/groups/ivm/imudeblurring/) * [Fast Removal of Non-uniform Camera Shake](http://webdav.is.mpg.de/pixel/fast_removal_of_camera_shake/) ###### Image Completion * [GIMP Resynthesizer](http://registry.gimp.org/node/27986) * [Priority BP](http://lafarren.com/image-completer/) * [ImageMelding](http://www.ece.ucsb.edu/~psen/melding) * [PlanarStructureCompletion](https://sites.google.com/site/jbhuang0604/publications/struct_completion) ###### Image Retargeting * [RetargetMe](http://people.csail.mit.edu/mrub/retargetme/) ###### Alpha Matting * [Alpha Matting Evaluation](http://www.alphamatting.com/) * [Closed-form image matting](http://people.csail.mit.edu/alevin/matting.tar.gz) * [Spectral Matting](http://www.vision.huji.ac.il/SpectralMatting/) * [Learning-based Matting](http://www.mathworks.com/matlabcentral/fileexchange/31412-learning-based-digital-matting) * [Improving Image Matting using Comprehensive Sampling Sets](http://www.alphamatting.com/ImprovingMattingComprehensiveSamplingSets_CVPR2013.zip) ###### Image Pyramid * [The Steerable Pyramid](http://www.cns.nyu.edu/~eero/steerpyr/) * [CurveLab](http://www.curvelet.org/) ###### Edge-preserving image processing * [Fast Bilateral Filter](http://people.csail.mit.edu/sparis/bf/) * [O(1) Bilateral Filter](http://www.cs.cityu.edu.hk/~qiyang/publications/code/qx.cvpr09.ctbf.zip) * [Recursive Bilateral Filtering](http://www.cs.cityu.edu.hk/~qiyang/publications/eccv-12/) * [Rolling Guidance Filter](http://www.cse.cuhk.edu.hk/leojia/projects/rollguidance/) * [Relative Total Variation](http://www.cse.cuhk.edu.hk/leojia/projects/texturesep/index.html) * [L0 Gradient Optimization](http://www.cse.cuhk.edu.hk/leojia/projects/L0smoothing/index.html) * [Domain Transform](http://www.inf.ufrgs.br/~eslgastal/DomainTransform/) * [Adaptive Manifold](http://inf.ufrgs.br/~eslgastal/AdaptiveManifolds/) * [Guided image filtering](http://research.microsoft.com/en-us/um/people/kahe/eccv10/) #### Intrinsic Images * [Recovering Intrinsic Images with a global Sparsity Prior on Reflectance](http://people.tuebingen.mpg.de/mkiefel/projects/intrinsic/) * [Intrinsic Images by Clustering](http://giga.cps.unizar.es/~elenag/projects/EGSR2012_intrinsic/) #### Contour Detection and Image Segmentation * [Mean Shift Segmentation](http://coewww.rutgers.edu/riul/research/code/EDISON/) * [Graph-based Segmentation](http://cs.brown.edu/~pff/segment/) * [Normalized Cut](http://www.cis.upenn.edu/~jshi/software/) * [Grab Cut](http://grabcut.weebly.com/background--algorithm.html) * [Contour Detection and Image Segmentation](http://www.eecs.berkeley.edu/Research/Projects/CS/vision/grouping/resources.html) * [Structured Edge Detection](http://research.microsoft.com/en-us/downloads/389109f6-b4e8-404c-84bf-239f7cbf4e3d/) * [Pointwise Mutual Information](http://web.mit.edu/phillipi/pmi-boundaries/) * [SLIC Super-pixel](http://ivrl.epfl.ch/research/superpixels) * [QuickShift](http://www.vlfeat.org/overview/quickshift.html) * [TurboPixels](http://www.cs.toronto.edu/~babalex/research.html) * [Entropy Rate Superpixel](http://mingyuliu.net/) * [Contour Relaxed Superpixels](http://www.vsi.cs.uni-frankfurt.de/research/current-projects/research/superpixel-segmentation/) * [SEEDS](http://www.mvdblive.org/seeds/) * [SEEDS Revised](https://github.com/davidstutz/seeds-revised) * [Multiscale Combinatorial Grouping](http://www.eecs.berkeley.edu/Research/Projects/CS/vision/grouping/mcg/) * [Fast Edge Detection Using Structured Forests](https://github.com/pdollar/edges) #### Interactive Image Segmentation * [Random Walker](http://cns.bu.edu/~lgrady/software.html) * [Geodesic Segmentation](http://www.tc.umn.edu/~baixx015/) * [Lazy Snapping](http://research.microsoft.com/apps/pubs/default.aspx?id=69040) * [Power Watershed](http://powerwatershed.sourceforge.net/) * [Geodesic Graph Cut](http://www.adobe.com/technology/people/san-jose/brian-price.html) * [Segmentation by Transduction](http://www.cs.cmu.edu/~olivierd/) #### Video Segmentation * [Video Segmentation with Superpixels](http://www.mpi-inf.mpg.de/departments/computer-vision-and-multimodal-computing/research/image-and-video-segmentation/video-segmentation-with-superpixels/) * [Efficient hierarchical graph-based video segmentation](http://www.cc.gatech.edu/cpl/projects/videosegmentation/) * [Object segmentation in video](http://lmb.informatik.uni-freiburg.de/Publications/2011/OB11/) * [Streaming hierarchical video segmentation](http://www.cse.buffalo.edu/~jcorso/r/supervoxels/) #### Camera calibration * [Camera Calibration Toolbox for Matlab](http://www.vision.caltech.edu/bouguetj/calib_doc/) * [Camera calibration With OpenCV](http://docs.opencv.org/trunk/doc/tutorials/calib3d/camera_calibration/camera_calibration.html#) * [Multiple Camera Calibration Toolbox](https://sites.google.com/site/prclibo/toolbox) #### Simultaneous localization and mapping ###### SLAM community: * [openSLAM](https://www.openslam.org/) * [Kitti Odometry: benchmark for outdoor visual odometry (codes may be available)](http://www.cvlibs.net/datasets/kitti/eval_odometry.php) ###### Tracking/Odometry: * [LIBVISO2: C++ Library for Visual Odometry 2](http://www.cvlibs.net/software/libviso/) * [PTAM: Parallel tracking and mapping](http://www.robots.ox.ac.uk/~gk/PTAM/) * [KFusion: Implementation of KinectFusion](https://github.com/GerhardR/kfusion) * [kinfu_remake: Lightweight, reworked and optimized version of Kinfu.](https://github.com/Nerei/kinfu_remake) * [LVR-KinFu: kinfu_remake based Large Scale KinectFusion with online reconstruction](http://las-vegas.uni-osnabrueck.de/related-projects/lvr-kinfu/) * [InfiniTAM: Implementation of multi-platform large-scale depth tracking and fusion](http://www.robots.ox.ac.uk/~victor/infinitam/) * [VoxelHashing: Large-scale KinectFusion](https://github.com/nachtmar/VoxelHashing) * [SLAMBench: Multiple-implementation of KinectFusion](http://apt.cs.manchester.ac.uk/projects/PAMELA/tools/SLAMBench/) * [SVO: Semi-direct visual odometry](https://github.com/uzh-rpg/rpg_svo) * [DVO: dense visual odometry](https://github.com/tum-vision/dvo_slam) * [FOVIS: RGB-D visual odometry](https://code.google.com/p/fovis/) ###### Graph Optimization: * [GTSAM: General smoothing and mapping library for Robotics and SFM](https://collab.cc.gatech.edu/borg/gtsam?destination=node%2F299) -- Georgia Institute of Technology * [G2O: General framework for graph optomization](https://github.com/RainerKuemmerle/g2o) ###### Loop Closure: * [FabMap: appearance-based loop closure system](http://www.robots.ox.ac.uk/~mjc/Software.htm) - also available in [OpenCV2.4.11](http://docs.opencv.org/2.4/modules/contrib/doc/openfabmap.html) * [DBoW2: binary bag-of-words loop detection system](http://webdiis.unizar.es/~dorian/index.php?p=32) ###### Localization & Mapping: * [RatSLAM](https://code.google.com/p/ratslam/) * [LSD-SLAM](https://github.com/tum-vision/lsd_slam) * [ORB-SLAM](https://github.com/raulmur/ORB_SLAM) #### Single-view Spatial Understanding * [Geometric Context](http://web.engr.illinois.edu/~dhoiem/projects/software.html) - Derek Hoiem (CMU) * [Recovering Spatial Layout](http://web.engr.illinois.edu/~dhoiem/software/counter.php?Down=varsha_spatialLayout.zip) - Varsha Hedau (UIUC) * [Geometric Reasoning](http://www.cs.cmu.edu/~./dclee/code/index.html) - David C. Lee (CMU) * [RGBD2Full3D](https://github.com/arron2003/rgbd2full3d) - Ruiqi Guo (UIUC) #### Object Detection * [INRIA Object Detection and Localization Toolkit](http://pascal.inrialpes.fr/soft/olt/) * [Discriminatively trained deformable part models](http://www.cs.berkeley.edu/~rbg/latent/) * [VOC-DPM](https://github.com/rbgirshick/voc-dpm) * [Histograms of Sparse Codes for Object Detection](http://www.ics.uci.edu/~dramanan/software/sparse/) * [R-CNN: Regions with Convolutional Neural Network Features](https://github.com/rbgirshick/rcnn) * [SPP-Net](https://github.com/ShaoqingRen/SPP_net) * [BING: Objectness Estimation](http://mmcheng.net/bing/comment-page-9/) * [Edge Boxes](https://github.com/pdollar/edges) * [ReInspect](https://github.com/Russell91/ReInspect) #### Nearest Neighbor Search ###### General purpose nearest neighbor search * [ANN: A Library for Approximate Nearest Neighbor Searching](http://www.cs.umd.edu/~mount/ANN/) * [FLANN - Fast Library for Approximate Nearest Neighbors](http://www.cs.ubc.ca/research/flann/) * [Fast k nearest neighbor search using GPU](http://vincentfpgarcia.github.io/kNN-CUDA/) ###### Nearest Neighbor Field Estimation * [PatchMatch](http://gfx.cs.princeton.edu/gfx/pubs/Barnes_2009_PAR/index.php) * [Generalized PatchMatch](http://gfx.cs.princeton.edu/pubs/Barnes_2010_TGP/index.php) * [Coherency Sensitive Hashing](http://www.eng.tau.ac.il/~simonk/CSH/) * [PMBP: PatchMatch Belief Propagation](https://github.com/fbesse/pmbp) * [TreeCANN](http://www.eng.tau.ac.il/~avidan/papers/TreeCANN_code_20121022.rar) #### Visual Tracking * [Visual Tracker Benchmark](https://sites.google.com/site/trackerbenchmark/benchmarks/v10) * [Visual Tracking Challenge](http://www.votchallenge.net/) * [Kanade-Lucas-Tomasi Feature Tracker](http://www.ces.clemson.edu/~stb/klt/) * [Extended Lucas-Kanade Tracking](http://www.eng.tau.ac.il/~oron/ELK/ELK.html) * [Online-boosting Tracking](http://www.vision.ee.ethz.ch/boostingTrackers/) * [Spatio-Temporal Context Learning](http://www4.comp.polyu.edu.hk/~cslzhang/STC/STC.htm) * [Locality Sensitive Histograms](http://www.shengfenghe.com/visual-tracking-via-locality-sensitive-histograms.html) * [Enhanced adaptive coupled-layer LGTracker++](http://www.cv-foundation.org/openaccess/content_iccv_workshops_2013/W03/papers/Xiao_An_Enhanced_Adaptive_2013_ICCV_paper.pdf) * [TLD: Tracking - Learning - Detection](http://personal.ee.surrey.ac.uk/Personal/Z.Kalal/tld.html) * [CMT: Clustering of Static-Adaptive Correspondences for Deformable Object Tracking](http://www.gnebehay.com/cmt/) * [Kernelized Correlation Filters](http://home.isr.uc.pt/~henriques/circulant/) * [Accurate Scale Estimation for Robust Visual Tracking](http://www.cvl.isy.liu.se/en/research/objrec/visualtracking/scalvistrack/index.html) * [Multiple Experts using Entropy Minimization](http://cs-people.bu.edu/jmzhang/MEEM/MEEM.html) * [TGPR](http://www.dabi.temple.edu/~hbling/code/TGPR.htm) * [CF2: Hierarchical Convolutional Features for Visual Tracking](https://sites.google.com/site/jbhuang0604/publications/cf2) * [Modular Tracking Framework](http://webdocs.cs.ualberta.ca/~vis/mtf/index.html) #### Saliency Detection #### Attributes #### Action Reconition #### Egocentric cameras #### Human-in-the-loop systems #### Image Captioning * [NeuralTalk](https://github.com/karpathy/neuraltalk) - #### Optimization * [Ceres Solver](http://ceres-solver.org/) - Nonlinear least-square problem and unconstrained optimization solver * [NLopt](http://ab-initio.mit.edu/wiki/index.php/NLopt)- Nonlinear least-square problem and unconstrained optimization solver * [OpenGM](http://hci.iwr.uni-heidelberg.de/opengm2/) - Factor graph based discrete optimization and inference solver * [GTSAM](https://collab.cc.gatech.edu/borg/gtsam/) - Factor graph based lease-square optimization solver #### Deep Learning * [Awesome Deep Vision](https://github.com/kjw0612/awesome-deep-vision) #### Machine Learning * [Awesome Machine Learning](https://github.com/josephmisiti/awesome-machine-learning) * [Bob: a free signal processing and machine learning toolbox for researchers](http://idiap.github.io/bob/) * [LIBSVM -- A Library for Support Vector Machines](https://www.csie.ntu.edu.tw/~cjlin/libsvm/) ## Datasets #### External Dataset Link Collection * [CV Datasets on the web](http://www.cvpapers.com/datasets.html) - CVPapers * [Are we there yet?](http://rodrigob.github.io/are_we_there_yet/build/) - Which paper provides the best results on standard dataset X? * [Computer Vision Dataset on the web](http://www.cvpapers.com/datasets.html) * [Yet Another Computer Vision Index To Datasets](http://riemenschneider.hayko.at/vision/dataset/) * [ComputerVisionOnline Datasets](http://www.computervisiononline.com/datasets) * [CVOnline Dataset](http://homepages.inf.ed.ac.uk/cgi/rbf/CVONLINE/entries.pl?TAG363) * [CV datasets](http://clickdamage.com/sourcecode/cv_datasets.php) * [visionbib](http://datasets.visionbib.com/info-index.html) * [VisualData](http://www.visualdata.io/) #### Low-level Vision ###### Stereo Vision * [Middlebury Stereo Vision](http://vision.middlebury.edu/stereo/) * [The KITTI Vision Benchmark Suite](http://www.cvlibs.net/datasets/kitti/eval_stereo_flow.php?benchmark=stero) * [LIBELAS: Library for Efficient Large-scale Stereo Matching](http://www.cvlibs.net/software/libelas/) * [Ground Truth Stixel Dataset](http://www.6d-vision.com/ground-truth-stixel-dataset) ###### Optical Flow * [Middlebury Optical Flow Evaluation](http://vision.middlebury.edu/flow/) * [MPI-Sintel Optical Flow Dataset and Evaluation](http://sintel.is.tue.mpg.de/) * [The KITTI Vision Benchmark Suite](http://www.cvlibs.net/datasets/kitti/eval_stereo_flow.php?benchmark=flow) * [HCI Challenge](http://hci.iwr.uni-heidelberg.de/Benchmarks/document/Challenging_Data_for_Stereo_and_Optical_Flow/) ###### Video Object Segmentation * [DAVIS: Densely Annotated VIdeo Segmentation](http://davischallenge.org/) * [SegTrack v2](http://web.engr.oregonstate.edu/~lif/SegTrack2/dataset.html) ###### Change Detection * [Labeled and Annotated Sequences for Integral Evaluation of SegmenTation Algorithms](http://www.gti.ssr.upm.es/data/LASIESTA) * [ChangeDetection.net](http://www.changedetection.net/) ###### Image Super-resolutions * [Single-Image Super-Resolution: A Benchmark](https://eng.ucmerced.edu/people/cyang35/ECCV14/ECCV14.html) #### Intrinsic Images * [Ground-truth dataset and baseline evaluations for intrinsic image algorithms](http://www.mit.edu/~kimo/publications/intrinsic/) * [Intrinsic Images in the Wild](http://opensurfaces.cs.cornell.edu/intrinsic/) * [Intrinsic Image Evaluation on Synthetic Complex Scenes](http://www.cic.uab.cat/Datasets/synthetic_intrinsic_image_dataset/) #### Material Recognition * [OpenSurface](http://opensurfaces.cs.cornell.edu/) * [Flickr Material Database](http://people.csail.mit.edu/celiu/CVPR2010/) * [Materials in Context Dataset](http://opensurfaces.cs.cornell.edu/publications/minc/) #### Multi-view Reconsturction * [Multi-View Stereo Reconstruction](http://vision.middlebury.edu/mview/) #### Saliency Detection #### Visual Tracking * [Visual Tracker Benchmark](https://sites.google.com/site/trackerbenchmark/benchmarks/v10) * [Visual Tracker Benchmark v1.1](https://sites.google.com/site/benchmarkpami/) * [VOT Challenge](http://www.votchallenge.net/) * [Princeton Tracking Benchmark](http://tracking.cs.princeton.edu/) * [Tracking Manipulation Tasks (TMT)](http://webdocs.cs.ualberta.ca/~vis/trackDB/) #### Visual Surveillance * [VIRAT](http://www.viratdata.org/) * [CAM2](https://cam2.ecn.purdue.edu/) #### Saliency Detection #### Change detection * [ChangeDetection.net](http://changedetection.net/) #### Visual Recognition ###### Image Classification * [The PASCAL Visual Object Classes](http://pascallin.ecs.soton.ac.uk/challenges/VOC/) * [ImageNet Large Scale Visual Recognition Challenge](http://www.image-net.org/challenges/LSVRC/2014/) ###### Scene Recognition * [SUN Database](http://groups.csail.mit.edu/vision/SUN/) * [Place Dataset](http://places.csail.mit.edu/) ###### Object Detection * [The PASCAL Visual Object Classes](http://pascallin.ecs.soton.ac.uk/challenges/VOC/) * [ImageNet Object Detection Challenge](http://www.image-net.org/challenges/LSVRC/2014/) * [Microsoft COCO](http://mscoco.org/) ###### Semantic labeling * [Stanford background dataset](http://dags.stanford.edu/projects/scenedataset.html) * [CamVid](http://mi.eng.cam.ac.uk/research/projects/VideoRec/CamVid/) * [Barcelona Dataset](http://www.cs.unc.edu/~jtighe/Papers/ECCV10/) * [SIFT Flow Dataset](http://www.cs.unc.edu/~jtighe/Papers/ECCV10/siftflow/SiftFlowDataset.zip) ###### Multi-view Object Detection * [3D Object Dataset](http://cvgl.stanford.edu/resources.html) * [EPFL Car Dataset](http://cvlab.epfl.ch/data/pose) * [KTTI Dection Dataset](http://www.cvlibs.net/datasets/kitti/eval_object.php) * [SUN 3D Dataset](http://sun3d.cs.princeton.edu/) * [PASCAL 3D+](http://cvgl.stanford.edu/projects/pascal3d.html) * [NYU Car Dataset](http://nyc3d.cs.cornell.edu/) ###### Fine-grained Visual Recognition * [Fine-grained Classification Challenge](https://sites.google.com/site/fgcomp2013/) * [Caltech-UCSD Birds 200](http://www.vision.caltech.edu/visipedia/CUB-200.html) ###### Pedestrian Detection * [Caltech Pedestrian Detection Benchmark](http://www.vision.caltech.edu/Image_Datasets/CaltechPedestrians/) * [ETHZ Pedestrian Detection](https://data.vision.ee.ethz.ch/cvl/aess/dataset/) #### Action Recognition ###### Image-based ###### Video-based * [HOLLYWOOD2 Dataset](http://www.di.ens.fr/~laptev/actions/hollywood2/) * [UCF Sports Action Data Set](http://crcv.ucf.edu/data/UCF_Sports_Action.php) ###### Image Deblurring * [Sun dataset](http://cs.brown.edu/~lbsun/deblur2013/deblur2013iccp.html) * [Levin dataset](http://www.wisdom.weizmann.ac.il/~levina/papers/LevinEtalCVPR09Data.rar) #### Image Captioning * [Flickr 8K](http://nlp.cs.illinois.edu/HockenmaierGroup/Framing_Image_Description/KCCA.html) * [Flickr 30K](http://shannon.cs.illinois.edu/DenotationGraph/) * [Microsoft COCO](http://mscoco.org/) #### Scene Understanding # [SUN RGB-D](http://rgbd.cs.princeton.edu/) - A RGB-D Scene Understanding Benchmark Suite # [NYU depth v2](http://cs.nyu.edu/~silberman/datasets/nyu_depth_v2.html) - Indoor Segmentation and Support Inference from RGBD Images #### Aerial images # [Aerial Image Segmentation](https://zenodo.org/record/1154821#.WmN9kHWnHIp) - Learning Aerial Image Segmentation From Online Maps ## Resources for students #### Resource link collection * [Resources for students](http://people.csail.mit.edu/fredo/student.html) - Frédo Durand (MIT) * [Advice for Graduate Students](http://www.dgp.toronto.edu/~hertzman/advice/) - Aaron Hertzmann (Adobe Research) * [Graduate Skills Seminars](http://www.dgp.toronto.edu/~hertzman/courses/gradSkills/2010/) - Yashar Ganjali, Aaron Hertzmann (University of Toronto) * [Research Skills](http://research.microsoft.com/en-us/um/people/simonpj/papers/giving-a-talk/giving-a-talk.htm) - Simon Peyton Jones (Microsoft Research) * [Resource collection](http://web.engr.illinois.edu/~taoxie/advice.htm) - Tao Xie (UIUC) and Yuan Xie (UCSB) #### Writing * [Write Good Papers](http://people.csail.mit.edu/fredo/FredoGoodWriting.pdf) - Frédo Durand (MIT) * [Notes on writing](http://people.csail.mit.edu/fredo/PUBLI/writing.pdf) - Frédo Durand (MIT) * [How to Write a Bad Article](http://people.csail.mit.edu/fredo/FredoBadWriting.pdf) - Frédo Durand (MIT) * [How to write a good CVPR submission](http://billf.mit.edu/sites/default/files/documents/cvprPapers.pdf) - William T. Freeman (MIT) * [How to write a great research paper](https://www.youtube.com/watch?v=g3dkRsTqdDA) - Simon Peyton Jones (Microsoft Research) * [How to write a SIGGRAPH paper](http://www.slideshare.net/jdily/how-to-write-a-siggraph-paper) - SIGGRAPH ASIA 2011 Course * [Writing Research Papers](http://www.dgp.toronto.edu/~hertzman/advice/writing-technical-papers.pdf) - Aaron Hertzmann (Adobe Research) * [How to Write a Paper for SIGGRAPH](http://www.computer.org/csdl/mags/cg/1987/12/mcg1987120062.pdf) - Jim Blinn * [How to Get Your SIGGRAPH Paper Rejected](http://www.siggraph.org/sites/default/files/kajiya.pdf) - Jim Kajiya (Microsoft Research) * [How to write a SIGGRAPH paper](www.liyiwei.org/courses/how-siga11/liyiwei.pptx) - Li-Yi Wei (The University of Hong Kong) * [How to Write a Great Paper](http://www-hagen.informatik.uni-kl.de/~bertram/talks/getpublished.pdf) - Martin Martin Hering Hering--Bertram (Hochschule Bremen University of Applied Sciences) * [How to have a paper get into SIGGRAPH?](http://www-ui.is.s.u-tokyo.ac.jp/~takeo/writings/siggraph.html) - Takeo Igarashi (The University of Tokyo) * [Good Writing](http://www.cs.cmu.edu/~pausch/Randy/Randy/raibert.htm) - Marc H. Raibert (Boston Dynamics, Inc.) * [How to Write a Computer Vision Paper](http://web.engr.illinois.edu/~dhoiem/presentations/How%20to%20Write%20a%20Computer%20Vison%20Paper.ppt) - Derek Hoiem (UIUC) * [Common mistakes in technical writing](http://www.cs.dartmouth.edu/~wjarosz/writing.html) - Wojciech Jarosz (Dartmouth College) #### Presentation * [Giving a Research Talk](http://people.csail.mit.edu/fredo/TalkAdvice.pdf) - Frédo Durand (MIT) * [How to give a good talk](http://www.dgp.toronto.edu/~hertzman/courses/gradSkills/2010/GivingGoodTalks.pdf) - David Fleet (University of Toronto) and Aaron Hertzmann (Adobe Research) * [Designing conference posters](http://colinpurrington.com/tips/poster-design) - Colin Purrington #### Research * [How to do research](http://people.csail.mit.edu/billf/www/papers/doresearch.pdf) - William T. Freeman (MIT) * [You and Your Research](http://www.cs.virginia.edu/~robins/YouAndYourResearch.html) - Richard Hamming * [Warning Signs of Bogus Progress in Research in an Age of Rich Computation and Information](http://yima.csl.illinois.edu/psfile/bogus.pdf) - Yi Ma (UIUC) * [Seven Warning Signs of Bogus Science](http://www.quackwatch.com/01QuackeryRelatedTopics/signs.html) - Robert L. Park * [Five Principles for Choosing Research Problems in Computer Graphics](https://www.youtube.com/watch?v=v2Qaf8t8I6c) - Thomas Funkhouser (Cornell University) * [How To Do Research In the MIT AI Lab](http://www.cs.indiana.edu/mit.research.how.to.html) - David Chapman (MIT) * [Recent Advances in Computer Vision](http://www.slideshare.net/antiw/recent-advances-in-computer-vision) - Ming-Hsuan Yang (UC Merced) * [How to Come Up with Research Ideas in Computer Vision?](http://www.slideshare.net/jbhuang/how-to-come-up-with-new-research-ideas-4005840) - Jia-Bin Huang (UIUC) * [How to Read Academic Papers](http://www.slideshare.net/jbhuang/how-to-read-academic-papers) - Jia-Bin Huang (UIUC) #### Time Management * [Time Management](https://www.youtube.com/watch?v=oTugjssqOT0) - Randy Pausch (CMU) ## Blogs * [Learn OpenCV](http://www.learnopencv.com/) - Satya Mallick * [Tombone's Computer Vision Blog](http://www.computervisionblog.com/) - Tomasz Malisiewicz * [Computer vision for dummies](http://www.visiondummy.com/) - Vincent Spruyt * [Andrej Karpathy blog](http://karpathy.github.io/) - Andrej Karpathy * [AI Shack](http://aishack.in/) - Utkarsh Sinha * [Computer Vision Talks](http://computer-vision-talks.com/) - Eugene Khvedchenya * [Computer Vision Basics with Python Keras and OpenCV](https://github.com/jrobchin/Computer-Vision-Basics-with-Python-Keras-and-OpenCV) - Jason Chin (University of Western Ontario) ## Links * [The Computer Vision Industry](http://www.cs.ubc.ca/~lowe/vision.html) - David Lowe * [German Computer Vision Research Groups & Companies](http://hci.iwr.uni-heidelberg.de/Links/German_Vision/) * [awesome-deep-learning](https://github.com/ChristosChristofidis/awesome-deep-learning) * [awesome-machine-learning](https://github.com/josephmisiti/awesome-machine-learning) * [Cat Paper Collection](http://www.eecs.berkeley.edu/~junyanz/cat/cat_papers.html) * [Computer Vision News](http://www.rsipvision.com/computer-vision-news/) * ## Songs * [The Fundamental Matrix Song](http://danielwedge.com/fmatrix/) * [The RANSAC Song](http://danielwedge.com/ransac/) * [Machine Learning A Cappella - Overfitting Thriller](https://www.youtube.com/watch?v=DQWI1kvmwRg) ## Licenses License [![CC0](http://i.creativecommons.org/p/zero/1.0/88x31.png)](http://creativecommons.org/publicdomain/zero/1.0/) To the extent possible under law, [Jia-Bin Huang](www.jiabinhuang.com) has waived all copyright and related or neighboring rights to this work.

简介

A curated list of awesome computer vision resources 展开 收起
CC0-1.0
取消

发行版

暂无发行版

贡献者

全部

近期动态

加载更多
不能加载更多了
马建仓 AI 助手
尝试更多
代码解读
代码找茬
代码优化
1
https://gitee.com/changjy/awesome-computer-vision.git
git@gitee.com:changjy/awesome-computer-vision.git
changjy
awesome-computer-vision
awesome-computer-vision
master

搜索帮助

D1d26d21 1850385 D1bef801 1850385