เฟซบุ๊กเปิดซอร์ส Detectron ซอฟต์แวร์จับวัตถุในภาพ อิมพลีเมนต์งานวิจัยใหม่ถึงปี 2017 | Blognone

เฟซบุ๊กเปิดซอร์ส Detectron ซอฟต์แวร์จับวัตถุในภาพ อิมพลีเมนต์งานวิจัยใหม่ถึงปี 2017

By: lew

on 23 January 2018 - 18:40 Tags:

Topics:

Artificial Intelligence

หน่วยวิจัยของเฟซบุ๊กเปิดตัว Detectron ซอฟต์แวร์จับวัตถุในภาพ (object detection) ที่อิมพลีเมนต์งานวิจัยยอดนิยมเช่น Faster R-CNN, RPN หรืองานวิจัยใหม่ๆ อย่าง Mask R-CNN และ RetinaNet ที่เพิ่งตีพิมพ์ในปี 2017 ที่ผ่านมา

ตัวซอฟต์แวร์พัฒนาบน Caffe2 โดยมี operator เฉพาะของ Detectron เอง ทำให้ใครที่ติดตั้ง Caffe2 อยู่แล้วอาจะต้องอัพเดตใหม่เพื่อให้รองรับ operator ใหม่ๆ เหล่านี้ด้วย และต้องการเครื่องที่มีชิปกราฟิกเท่านั้นไม่สามารถรันบนซีพียูได้

ตัวสัญญาอนุญาตเป็น Apache License 2.0 และเฟซบุ๊กระบุว่าสถาปัตยกรรมน่าจะง่ายต่อการเพิ่มเติมโมเดลในอนาคต ถ้าใครสนใจส่งแพตช์ทางเฟซบุ๊กก็ยินดี

ที่มา - Ross Girshick, GitHub: facebookresearch/Detectron
No Description

Hiring! บริษัทที่น่าสนใจ

Thoughtworks Thailand company cover

Thoughtworks Thailand

Thoughtworks เป็นบริษัทที่ปรึกษาด้านเทคโนโยลีระดับโลกที่คว้า Great Place to Work 3 ปีซ้อน

H LAB company cover

Re-engineering healthcare systems through intelligent platforms and system design.

Iron Software company cover

Iron Software is an American company providing a suite of .NET libraries by engineer for engineers.

Comments

By: whitebigbird

on 23 January 2018 - 22:04 #1029937

อาจฟังดูเหลวไหลนะครับ แต่ผมอยากรู้จริงๆ ว่ามันแยกลุงตู่ กับแผ่นรูปลุงตู่ตอนยืนคู่กันได้มั้ยครับ หรือมันจะมองว่าเป็น person ทั้งคู่

Log in or register to post comments

By: sapjunior

on 24 January 2018 - 00:37 #1029950 Reply to:1029937

ถ้าตาม pretrained model จาก imagenet หรือ coco จะออกมาเป็น person ทั้งคู่ครับ

Log in or register to post comments

By: whitebigbird

on 24 January 2018 - 09:06 #1029970 Reply to:1029950

ขอบคุณครับ ถ้าสมมติว่ามี infrared map มันจะช่วยให้แยกแยะคนจริงกับรูปถ่ายได้ดีขึ้นมั้ยครับ

Log in or register to post comments

By: EThaiZone

on 24 January 2018 - 09:47 #1029975 Reply to:1029970

EThaiZone's picture

ดีขึ้นครับ แต่ต้องเทรนโมเดลใหม่จากข้อมูลใหม่ และต้องปรับปรุง network ที่ใช้ train กับ predict อะ

มันไม่ง่ายเลยที่จะทำ GIF ให้มีขนาดน้อยกว่า 20kB

Log in or register to post comments

By: Hadakung

on 24 January 2018 - 12:42 #1030013 Reply to:1029970

ข้อสำคัญคือ infrared 3D ต้องแม่นด้วยนะครับใช้ Intel Read Sense ตัว R300 คนข้างจะแย่อยู่ครับ แต่รอซื้อรุ่น D435 มาน่าจะดีขึ้นเยอะ

Log in or register to post comments

By: whitebigbird

on 24 January 2018 - 21:32 #1030103 Reply to:1029970

ขอบคุณทั้งสองท่านครับ

Log in or register to post comments