存档

2011年12月 的存档

[招聘] 华为 多媒体技术实验室杭州分部

2011年12月31日 5 条评论

Job Function:

Researcher responsible for developing algorithms and prototype of research projects related to video and audio. The candidate will work with the multimedia technology lab in Hangzhou or Shenzhen.

Skills/Experience

Must hold a Bachelors and/or a Masters/or a PhD in Electrical Engineering/or Computer Engineering with at least 2+ years experience in design, development, and integration of multimedia algorithms.

Strong Knowledge in computational camera, computational photography, 3D display and video/audio signal processing.

Knowledge in video standards on H.264, MPEG4, VP6/VP8, or graphics domain, Open GL standards is a definite plus.

Knowledge in audio standards on AMR, AMR-WB, G.711, G.719 and OpenAL.

Knowledge in optical lens design is a plus.

Strong C, C++, and MatLab development skills is required.

Experience in initiating a research proposal and conduct the research activities, developing deliverable software with a GUI interface is desired.

Should have good analytical ability, problem solving skills and be a self-starter.

Work well within a matrix organization and able to influence and collaborate with team members all over the world.

阅读全文…

分类: 新闻 标签: ,

vibe

2011年12月25日 22 条评论

– a powerful technique for background detection and subtraction in video sequences

Executive summary

Description

ViBe is a powerful pixel-based technique that detects the background in video sequences. Many experiments have shown that it performs better than the state-of-the-art techniques known in the scientific literature. In addition the computational load is lower than simple background techniques implemented in commercial products. ViBe is the perfect solution for both software and hardware implementations.

Code and program for Windows and Linux
  • A program for Windows and Linux. Download an archive zip archive [10 MB - updated on May 19, 2011] to use ViBe on Windows (or under Wine in Linux). Details on this page.
    The program allows you to: (1) save the result for your own images, (2) change the few parameters of ViBe to experiment with, and (3) reproduce our results.
  • Linux: link a C/C++ object file to your own code. We provide the object (compiled) code of ViBe for non-commercial applications. Under Linux, download the 32 bits zip or compressed tar file, or the 64 bits zip or compressed tar file. Details on this page.

阅读全文…

机器学习中的算法(1)-决策树模型组合之随机森林与GBDT

2011年12月17日 1 条评论

 

 

 

 

转载自LeftNotEasy的博客

 

前言:

    决策树这种算法有着很多良好的特性,比如说训练时间复杂度较低,预测的过程比较快速,模型容易展示(容易将得到的决策树做成图片展示出来)等。但是同时,单决策树又有一些不好的地方,比如说容易over-fitting,虽然有一些方法,如剪枝可以减少这种情况,但是还是不够的。

    模型组合(比如说有Boosting,Bagging等)与决策树相关的算法比较多,这些算法最终的结果是生成N(可能会有几百棵以上)棵树,这样可以大大的减少单决策树带来的毛病,有点类似于三个臭皮匠等于一个诸葛亮的做法,虽然这几百棵决策树中的每一棵都很简单(相对于C4.5这种单决策树来说),但是他们组合起来确是很强大。

    在最近几年的paper上,如iccv这种重量级的会议,iccv 09年的里面有不少的文章都是与Boosting与随机森林相关的。模型组合+决策树相关的算法有两种比较基本的形式 – 随机森林与GBDT((Gradient Boost Decision Tree),其他的比较新的模型组合+决策树的算法都是来自这两种算法的延伸。本文主要侧重于GBDT,对于随机森林只是大概提提,因为它相对比较简单。

    在看本文之前,建议先看看机器学习与数学(3)与其中引用的论文,本文中的GBDT主要基于此,而随机森林相对比较独立。

阅读全文…

分类: 新闻 标签:

KinectFusion的PCL实现

2011年12月15日 10 条评论

WillowGarage的猛士再次发力,实现了今年ISMAR上炫目的KinectFusion

The preliminary source code is currently available in our SVN repository’s trunk in the CUDA/KinFu module. Since this code is still unreleased and under active development, we won’t be able to provide support via our forums yet; however, advanced users are free to check out the code and give it a try. Be advised that this code relies heavily on the NVidia CUDA development libraries for GPU optimizations and will require a compatible GPU for best results.

Moving forward, we continue to refine and improve the system, and we are hoping to improve upon the original algorithm in order to model larger scale environments in the near future. We are targeting a stable release date to coincide with the upcoming PCL 2.0 release next year. (Please note there is no planned release in the 1.x branch.)

阅读全文…

Learning ImageMagick 5: 颜色通道操作

2011年12月14日 没有评论

一对对偶操作:分离颜色通道,合并颜色通道。

Lena2 Lena_B Lena_G Lena_R

分离颜色通道:使用-channel 和 –separete 参数

convert ..\SampleImages\Lena.png -channel R -separate Lena_R.png
convert ..\SampleImages\Lena.png -channel G -separate Lena_G.png
convert ..\SampleImages\Lena.png -channel B -separate Lena_B.png

另外一种分离方式:同时提取三个通道。

convert ..\SampleImages\Lena.png  -separate Lena_RGB_%d.png

合并颜色通道:使用-combine 参数

convert Lena_R.png Lena_G.png Lena_B.png -combine Lena2.png

分类: 新闻 标签: ,

【代发广告】Visiting Scholar Position in Carnegie Mellon University

2011年12月11日 13 条评论

Applications are invited for a full-time visiting scholar position in the Human Sensing Lab (http://humansensing.cs.cmu.edu) at Carnegie Mellon University. Applicants should have an B.A. in computer science, electrical engineering, physics or related fields, and a some expertise in COMPUTER VISION and/or machine learning. The position is for one year starting in FEBRUARY 2012 and it will cover living expenses and health care. The candidate will be responsible for developing computer vision algorithms for a MEDICAL IMAGE Problem. The project will require proficiency in Matlab, but some parts will be developed in C/C++. Knowledge of is a plus. This is a one year visiting position to strength your research CV, it is not a M.S. or PhD. program at CMU. The CV should be sent to ftorre@cs.cmu.edu.

分类: 新闻 标签:

ZZ OpenCV在iOS和x86平台上的性能测试

2011年12月8日 6 条评论

来自老杨的一篇评测。

好久没更新博客了,最近都在忙找工作,目前来看形势还不错。感觉我基本已经跟computer vision没什么关系了,面了很多公司,只有一个是视觉的,其它都是纯码农活儿了。

这学期上了个计算机架构的水课,最后的大作业我就benchmark了一下OpenCV在x86和ARM下面的性能,我的笔记本的CPU是Intel Core i7 620M,iOS测试我用的是iPod Touch,和iPhone 4是一样的CPU, Apple A4。最新的iPhone 4s和iPad2上的Apple A5没机会测,因为这俩都不能完全越狱。x86比ARM快那是肯定的,不过这个benchmark可以看出来到底差多少。

编译:在x86上我用的是64位编译的,因为Apple A4分别支持ARM v6和v7两个版本,我就分别都编译了不同的静态库。

测试:我使用了不同的数据类型,8/16/32位整形,32/64位浮点;不同的输入矩阵大小,4*4/8*8/…/256*256/512*512;不同的操作,加、乘、转置、求逆、SVD,还有一组图像处理的比较。

下面是一些比较结果:

阅读全文…

分类: 新闻 标签: , , , ,

链接大放送

2011年12月7日 4 条评论

 

两个AR的视频:一个是利用kinect的魔镜,二是“三维立体导航”

来自Andol的hand detector

Flow by Amazon,结合了Barcode和物体识别。

非常浅显易懂的tf-idf导论I, II

分类: 新闻 标签: , , ,

ZZ 自制低成本3D激光扫描测距仪(3D激光雷达),第二部分

2011年12月1日 17 条评论

来自csk的低成本3d scanner。本文的前传在此

 

1. 设备设计

核心元件原型

在第一篇文章的原理介绍[1]中,已经大致提到了本次制作的核心元件:摄像头、激光器以及进行扫描的伺服电机的 选型要求。

对于我期望的精度和性能,一般市面常见的USB VGA摄像头即可满足要求。

图:本制作使用的USB摄像头(已经拆除外壳)

阅读全文…