Advanced Tutorials - Removing Objects - Lesson 7 - Find Mask
Contextual Attention

An object removal from image system using deep learning image segmentation and inpainting techniques.

Object removal from image involves two separate tasks, object detection and object removal.

Deep learning is a machine learning technique that teaches computers to do what comes naturally to humans: learn by example. Deep learning is a key technology behind driverless cars, enabling them to recognize a stop sign, or to distinguish a pedestrian from a lamppost.

Python is a high-level, general-purpose programming language. Its design philosophy emphasizes code readability with the use of significant indentation. Python is dynamically typed and garbage-collected. It supports multiple programming paradigms, including structured, object-oriented and functional programming.

The first task is handled by the user drawing a bounding box around an object of interest to be removed. We could then remove all pixels inside the bounding box, but this could lead to loss of valuable information from the pixels in the box that are not part of the object. Instead Mask-RCNN, a state of the art instance segmentation model is used to get the exact mask of the object.

Filling in the image is done using DeepFillv2, an image inpainting generative adversarial network which employs a gated convolution system.

The result is a complete image with the object removed.

Downloading Contextual Attentions class:
https://drive.google.com/file/d/1Cz4acHXQtCj_Tvyy-WwYEZv3y7peugPB/view?usp=sharing

دورة تعليمية متقدمة في إزالة الأغراض من الصور - الدرس 7 - العثور على الغرض

إزالة كائن من نظام الصور باستخدام تقنيات تجزئة الصور والتعلم العميق.

تتضمن إزالة الكائن من الصورة مهمتين منفصلتين، اكتشاف الكائن وإزالة الكائن.

يتم التعامل مع المهمة الأولى من خلال قيام المستخدم برسم مربع محيط حول كائن محل الاهتمام المراد إزالته. يمكننا بعد ذلك إزالة جميع وحدات البكسل الموجودة داخل المربع المحيط، لكن هذا قد يؤدي إلى فقدان معلومات قيمة من وحدات البكسل الموجودة في المربع والتي لا تشكل جزءًا من الكائن. بدلاً من ذلك، يتم استخدام نموذج تجزئة مثيل متطور للحصول على القناع الدقيق للكائن.

بايثون هي لغة برمجة عالية المستوى للأغراض العامة. تؤكد فلسفة التصميم الخاصة بها على سهولة قراءة التعليمات البرمجية باستخدام مسافة بادئة كبيرة. تتم كتابة لغة بايثون ديناميكيًا ويتم جمع البيانات المهملة فيها. وهو يدعم نماذج برمجة متعددة، بما في ذلك البرمجة المنظمة والموجهة للكائنات والوظيفية.

التعلم العميق هو أحد تقنيات التعلم الآلي التي تعلم أجهزة الكمبيوتر القيام بما هو طبيعي بالنسبة للبشر: التعلم بالقدوة. يعد التعلم العميق تقنية أساسية وراء السيارات ذاتية القيادة، حيث تمكنها من التعرف على إشارة التوقف، أو التمييز بين المشاة وعمود الإنارة.

يتم ملء الصورة ، وهي صورة ترسم شبكة خصومة توليدية.

والنتيجة هي صورة كاملة مع إزالة الكائن.

تحميل صف المستخدم في عملية الترميم - التركيز على السياق-
https://drive.google.com/file/d/1Cz4acHXQtCj_Tvyy-WwYEZv3y7peugPB/view?usp=sharing