Holdings: Computer vision projects with PyTorch

Computer vision projects with PyTorch: design and develop production-grade ,odels

Saved in:

Bibliographic Details
Main Authors:	Kulkarni, Akshay (Author), Shivananda, Adarsha (Author), Sharma, Nitin Ranjan (Author)
Format:	Book
Language:	English
Published:	New York Apress [2022]
Subjects:	bicssc bisacsh Python (Computer program language) Artificial intelligence Machine learning Hardcover, Softcover / Informatik, EDV/Informatik
Item Description:	Design and develop end-to-end, production-grade computer vision projects for real-world industry problems. This book discusses computer vision algorithms and their applications using PyTorch.The book begins with the fundamentals of computer vision: convolutional neural nets, RESNET, YOLO, data augmentation, and other regularization techniques used in the industry. And then it gives you a quick overview of the PyTorch libraries used in the book. After that, it takes you through the implementation of image classification problems, object detection techniques, and transfer learning while training and running inference. The book covers image segmentation and an anomaly detection model. And it discusses the fundamentals of video processing for computer vision tasks putting images into videos. The book concludes with an explanation of the complete model building process for deep learning frameworks using optimized techniques with highlights on model AI explainability.After reading this book, you will be able to build your own computer vision projects using transfer learning and PyTorch.What You Will Learn- Solve problems in computer vision with PyTorch.- Implement transfer learning and perform image classification, object detection, image segmentation, and other computer vision applications- Design and develop production-grade computer vision projects for real-world industry problems- Interpret computer vision models and solve business problems Chapter 1: Building Blocks of Computer VisionChapter Goal: The chapter will start with the basic concepts of Computer Vision. We will cover theoretical aspects that lays the foundation for the upcoming hands-on projects on Computer Vision. No of pages:35Sub -Topics1. Overview of Computer Vision2. Understanding AlexNET, Convolutional Neural Network and receptive fields3. Understanding advanced concepts like RESNETS and inception network4. Discuss how usage of batch normalization, drop outs, data augmentation techniques help solve data insufficiency in deep learning models5. Introduction to PyTorch for Computer Vision models; Chapter 2: Building Image Classification ModelChapter Goal: The chapter will discuss about image classification model along with data augmentation techniques.No of pages: 40Sub - Topics 1. Data preparation for image classification problem2. Data augmentation techniques3. Setting up model architecture with explanation4. - Train and run inference for the Image Classification model5. Discuss Grouped Convolution, Dilated Convolution and transposed convolution and their application; Chapter 3: Building Object Detection ModelChapter Goal: This chapter will explain the core difference between simple classification model to detecting objects in an image. We will understand optimizing loss function to get the final object localized and detected. The chapter will take through some concepts of the existing models and how to fine tune them.No of pages: 30Sub - Topics: 1. Exploring Object Detection concepts like FastRCNN, YOLO2. Explaining annotations and examples of how annotations are used in Object Detection3. Explaining loss function components4. Building Object Detection model, using transfer learning technique5. Running inference on fine-tuned model; Chapter 4: Building Image Segmentation ModelChapter Goal: The chapter will define how single or multiple images can be segmented in an image. - How a user can define a loss function and develop a model to segregate image outlines. No of pages: 35Sub - Topics: 1. Concepts on how segmentation works on Images2. Explaining custom pre trained models3. Defining and explaining loss functions4. Implementing & fine-tuning Image Segmentation model ; Chapter 5: Image Similarity & Image based SearchChapter Goal: The chapter deals with the explanation of how the image similarity works and how use cases move around this concept. No of pages: 25Sub - Topics: 1. Defining Image similarity and anomaly problems for images2. Defining the datasets3. Defining the loss functions and methodologies4. Providing solutions for Detecting Image similarities; Chapter 6: Image Anomaly DetectionChapter Goal: The chapter deals with the explanation of how anomalies from images can be detected and use-cases around it.No of pages: 20Sub - Topics: 1. Defining anomaly problems for images2. Defining the datasets3. Defining the loss functions and methodologies4. - Detecting anomalies on images ; Chapter 7: Video Processing Applications using PyTorchChapter Goal: This chapter deals with various mechanism of video processing techniques. This chapter will help one to deal with untangling the complexities of video with series of images placed in time sequence. Concepts of RNN/LSTM/GRU will be discussed to solve real time use-cases on videos.No of pages: 50Sub - Topics: 1. Setting up concepts of time dependent feature set2. Extrapolating images to videos3. Setting up concepts for video processing using Convolutional Neural Networks4. Defining the dataset and the loss function5. Defining the model6. Training the model and run inference ; Chapter 8: Super-resolution through Upscaling & GANChapter Goal: This chapter deals with foundations on Generative Adversarial Networks in the field of computer vision. - The concepts will be extrapolated with an use-case to how it is being used in super resolution (Enhancing Image Quality)No of pages: 30Sub - Topics: 1. Establish the concept of upscaling in images1. Foundations of VAE and GAN in images2. Setting up codes in GAN for super resolution3. Using the concept to understand data augmentation using GAN; Chapter 9: Body Posture DetectionChapter Goal: This chapter will establish the concept of multiple body posture detection. It will have the code encompassed the detection and multiple methods around posture detection applications.No of pages: 30Sub - Topics: 1. Discussing top-down and bottom-up approach to detect persons2. Discuss open pose detection model to establish body pose3. Use of segmentation technique to detect body pose; Chapter 10: Explainable AI for Computer Vision using GRADCAMChapter Goal: This chapter deals with foundations on how a deep learning model results can be explained. - An overview of GRADCAM and how the concepts help someone explaining a Computer Vision model will be discussed in abundance.No of pages: 15Sub - Topics: 1. Revisit the concepts of explain-able AI2. Deep learning explainers to CV classification model3. Setting up concepts of GRADCAM4. Implementing how Computer Vision models can be interpreted by GRADCAM
Physical Description:	xvi, 346 Seiten Illustrationen 557 grams
ISBN:	9781484282724

Staff View

MARC


LEADER	00000nam a2200000 c 4500
001	BV048446729
003	DE-604
005	20221007
007	t
008	220829s2022 a\|\|\| \|\|\|\| 00\|\|\| eng d
020			\|a 9781484282724 \|9 978-1-4842-8272-4
024	3		\|a 9781484282724
035			\|a (OCoLC)1347212967
035			\|a (DE-599)BVBBV048446729
040			\|a DE-604 \|b ger \|e rda
041	0		\|a eng
049			\|a DE-29T
100	1		\|a Kulkarni, Akshay \|e Verfasser \|0 (DE-588)1177937689 \|4 aut
245	1	0	\|a Computer vision projects with PyTorch \|b design and develop production-grade ,odels \|c Akshay Kulkarni, Adarsha Shivananda, Nitin Ranjan Sharma
264		1	\|a New York \|b Apress \|c [2022]
300			\|a xvi, 346 Seiten \|b Illustrationen \|c 557 grams
336			\|b txt \|2 rdacontent
337			\|b n \|2 rdamedia
338			\|b nc \|2 rdacarrier
500			\|a Design and develop end-to-end, production-grade computer vision projects for real-world industry problems. This book discusses computer vision algorithms and their applications using PyTorch.The book begins with the fundamentals of computer vision: convolutional neural nets, RESNET, YOLO, data augmentation, and other regularization techniques used in the industry. And then it gives you a quick overview of the PyTorch libraries used in the book. After that, it takes you through the implementation of image classification problems, object detection techniques, and transfer learning while training and running inference. The book covers image segmentation and an anomaly detection model. And it discusses the fundamentals of video processing for computer vision tasks putting images into videos. The book concludes with an explanation of the complete model building process for deep learning frameworks using optimized techniques with highlights on model AI explainability.After reading this book, you will be able to build your own computer vision projects using transfer learning and PyTorch.What You Will Learn- Solve problems in computer vision with PyTorch.- Implement transfer learning and perform image classification, object detection, image segmentation, and other computer vision applications- Design and develop production-grade computer vision projects for real-world industry problems- Interpret computer vision models and solve business problems
500			\|a Chapter 1: Building Blocks of Computer VisionChapter Goal: The chapter will start with the basic concepts of Computer Vision. We will cover theoretical aspects that lays the foundation for the upcoming hands-on projects on Computer Vision. No of pages:35Sub -Topics1. Overview of Computer Vision2. Understanding AlexNET, Convolutional Neural Network and receptive fields3. Understanding advanced concepts like RESNETS and inception network4. Discuss how usage of batch normalization, drop outs, data augmentation techniques help solve data insufficiency in deep learning models5. Introduction to PyTorch for Computer Vision models; Chapter 2: Building Image Classification ModelChapter Goal: The chapter will discuss about image classification model along with data augmentation techniques.No of pages: 40Sub - Topics 1. Data preparation for image classification problem2. Data augmentation techniques3. Setting up model architecture with explanation4.
500			\|a - Train and run inference for the Image Classification model5. Discuss Grouped Convolution, Dilated Convolution and transposed convolution and their application; Chapter 3: Building Object Detection ModelChapter Goal: This chapter will explain the core difference between simple classification model to detecting objects in an image. We will understand optimizing loss function to get the final object localized and detected. The chapter will take through some concepts of the existing models and how to fine tune them.No of pages: 30Sub - Topics: 1. Exploring Object Detection concepts like FastRCNN, YOLO2. Explaining annotations and examples of how annotations are used in Object Detection3. Explaining loss function components4. Building Object Detection model, using transfer learning technique5. Running inference on fine-tuned model; Chapter 4: Building Image Segmentation ModelChapter Goal: The chapter will define how single or multiple images can be segmented in an image.
500			\|a - How a user can define a loss function and develop a model to segregate image outlines. No of pages: 35Sub - Topics: 1. Concepts on how segmentation works on Images2. Explaining custom pre trained models3. Defining and explaining loss functions4. Implementing & fine-tuning Image Segmentation model ; Chapter 5: Image Similarity & Image based SearchChapter Goal: The chapter deals with the explanation of how the image similarity works and how use cases move around this concept. No of pages: 25Sub - Topics: 1. Defining Image similarity and anomaly problems for images2. Defining the datasets3. Defining the loss functions and methodologies4. Providing solutions for Detecting Image similarities; Chapter 6: Image Anomaly DetectionChapter Goal: The chapter deals with the explanation of how anomalies from images can be detected and use-cases around it.No of pages: 20Sub - Topics: 1. Defining anomaly problems for images2. Defining the datasets3. Defining the loss functions and methodologies4.
500			\|a - Detecting anomalies on images ; Chapter 7: Video Processing Applications using PyTorchChapter Goal: This chapter deals with various mechanism of video processing techniques. This chapter will help one to deal with untangling the complexities of video with series of images placed in time sequence. Concepts of RNN/LSTM/GRU will be discussed to solve real time use-cases on videos.No of pages: 50Sub - Topics: 1. Setting up concepts of time dependent feature set2. Extrapolating images to videos3. Setting up concepts for video processing using Convolutional Neural Networks4. Defining the dataset and the loss function5. Defining the model6. Training the model and run inference ; Chapter 8: Super-resolution through Upscaling & GANChapter Goal: This chapter deals with foundations on Generative Adversarial Networks in the field of computer vision.
500			\|a - The concepts will be extrapolated with an use-case to how it is being used in super resolution (Enhancing Image Quality)No of pages: 30Sub - Topics: 1. Establish the concept of upscaling in images1. Foundations of VAE and GAN in images2. Setting up codes in GAN for super resolution3. Using the concept to understand data augmentation using GAN; Chapter 9: Body Posture DetectionChapter Goal: This chapter will establish the concept of multiple body posture detection. It will have the code encompassed the detection and multiple methods around posture detection applications.No of pages: 30Sub - Topics: 1. Discussing top-down and bottom-up approach to detect persons2. Discuss open pose detection model to establish body pose3. Use of segmentation technique to detect body pose; Chapter 10: Explainable AI for Computer Vision using GRADCAMChapter Goal: This chapter deals with foundations on how a deep learning model results can be explained.
500			\|a - An overview of GRADCAM and how the concepts help someone explaining a Computer Vision model will be discussed in abundance.No of pages: 15Sub - Topics: 1. Revisit the concepts of explain-able AI2. Deep learning explainers to CV classification model3. Setting up concepts of GRADCAM4. Implementing how Computer Vision models can be interpreted by GRADCAM
650		4	\|a bicssc
650		4	\|a bicssc
650		4	\|a bisacsh
650		4	\|a bisacsh
650		4	\|a Python (Computer program language)
650		4	\|a Artificial intelligence
650		4	\|a Machine learning
653			\|a Hardcover, Softcover / Informatik, EDV/Informatik
700	1		\|a Shivananda, Adarsha \|e Verfasser \|0 (DE-588)1177938383 \|4 aut
700	1		\|a Sharma, Nitin Ranjan \|e Verfasser \|4 aut
776	0	8	\|i Erscheint auch als \|n Online-Ausgabe \|z 978-1-4842-8273-1
999			\|a oai:aleph.bib-bvb.de:BVB01-033824950

Record in the Search Index

_version_	1804184375665885184
adam_txt
any_adam_object
any_adam_object_boolean
author	Kulkarni, Akshay Shivananda, Adarsha Sharma, Nitin Ranjan
author_GND	(DE-588)1177937689 (DE-588)1177938383
author_facet	Kulkarni, Akshay Shivananda, Adarsha Sharma, Nitin Ranjan
author_role	aut aut aut
author_sort	Kulkarni, Akshay
author_variant	a k ak a s as n r s nr nrs
building	Verbundindex
bvnumber	BV048446729
ctrlnum	(OCoLC)1347212967 (DE-599)BVBBV048446729
format	Book
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>08026nam a2200481 c 4500</leader><controlfield tag="001">BV048446729</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20221007 </controlfield><controlfield tag="007">t</controlfield><controlfield tag="008">220829s2022 a\|\|\| \|\|\|\| 00\|\|\| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781484282724</subfield><subfield code="9">978-1-4842-8272-4</subfield></datafield><datafield tag="024" ind1="3" ind2=" "><subfield code="a">9781484282724</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1347212967</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV048446729</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-29T</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Kulkarni, Akshay</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)1177937689</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Computer vision projects with PyTorch</subfield><subfield code="b">design and develop production-grade ,odels</subfield><subfield code="c">Akshay Kulkarni, Adarsha Shivananda, Nitin Ranjan Sharma</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">New York</subfield><subfield code="b">Apress</subfield><subfield code="c">[2022]</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">xvi, 346 Seiten</subfield><subfield code="b">Illustrationen</subfield><subfield code="c">557 grams</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Design and develop end-to-end, production-grade computer vision projects for real-world industry problems. This book discusses computer vision algorithms and their applications using PyTorch.The book begins with the fundamentals of computer vision: convolutional neural nets, RESNET, YOLO, data augmentation, and other regularization techniques used in the industry. And then it gives you a quick overview of the PyTorch libraries used in the book. After that, it takes you through the implementation of image classification problems, object detection techniques, and transfer learning while training and running inference. The book covers image segmentation and an anomaly detection model. And it discusses the fundamentals of video processing for computer vision tasks putting images into videos. The book concludes with an explanation of the complete model building process for deep learning frameworks using optimized techniques with highlights on model AI explainability.After reading this book, you will be able to build your own computer vision projects using transfer learning and PyTorch.What You Will Learn- Solve problems in computer vision with PyTorch.- Implement transfer learning and perform image classification, object detection, image segmentation, and other computer vision applications- Design and develop production-grade computer vision projects for real-world industry problems- Interpret computer vision models and solve business problems</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Chapter 1: Building Blocks of Computer VisionChapter Goal: The chapter will start with the basic concepts of Computer Vision. We will cover theoretical aspects that lays the foundation for the upcoming hands-on projects on Computer Vision. No of pages:35Sub -Topics1. Overview of Computer Vision2. Understanding AlexNET, Convolutional Neural Network and receptive fields3. Understanding advanced concepts like RESNETS and inception network4. Discuss how usage of batch normalization, drop outs, data augmentation techniques help solve data insufficiency in deep learning models5. Introduction to PyTorch for Computer Vision models; Chapter 2: Building Image Classification ModelChapter Goal: The chapter will discuss about image classification model along with data augmentation techniques.No of pages: 40Sub - Topics 1. Data preparation for image classification problem2. Data augmentation techniques3. Setting up model architecture with explanation4. </subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a"> - Train and run inference for the Image Classification model5. Discuss Grouped Convolution, Dilated Convolution and transposed convolution and their application; Chapter 3: Building Object Detection ModelChapter Goal: This chapter will explain the core difference between simple classification model to detecting objects in an image. We will understand optimizing loss function to get the final object localized and detected. The chapter will take through some concepts of the existing models and how to fine tune them.No of pages: 30Sub - Topics: 1. Exploring Object Detection concepts like FastRCNN, YOLO2. Explaining annotations and examples of how annotations are used in Object Detection3. Explaining loss function components4. Building Object Detection model, using transfer learning technique5. Running inference on fine-tuned model; Chapter 4: Building Image Segmentation ModelChapter Goal: The chapter will define how single or multiple images can be segmented in an image. </subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a"> - How a user can define a loss function and develop a model to segregate image outlines. No of pages: 35Sub - Topics: 1. Concepts on how segmentation works on Images2. Explaining custom pre trained models3. Defining and explaining loss functions4. Implementing & fine-tuning Image Segmentation model ; Chapter 5: Image Similarity & Image based SearchChapter Goal: The chapter deals with the explanation of how the image similarity works and how use cases move around this concept. No of pages: 25Sub - Topics: 1. Defining Image similarity and anomaly problems for images2. Defining the datasets3. Defining the loss functions and methodologies4. Providing solutions for Detecting Image similarities; Chapter 6: Image Anomaly DetectionChapter Goal: The chapter deals with the explanation of how anomalies from images can be detected and use-cases around it.No of pages: 20Sub - Topics: 1. Defining anomaly problems for images2. Defining the datasets3. Defining the loss functions and methodologies4. </subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a"> - Detecting anomalies on images ; Chapter 7: Video Processing Applications using PyTorchChapter Goal: This chapter deals with various mechanism of video processing techniques. This chapter will help one to deal with untangling the complexities of video with series of images placed in time sequence. Concepts of RNN/LSTM/GRU will be discussed to solve real time use-cases on videos.No of pages: 50Sub - Topics: 1. Setting up concepts of time dependent feature set2. Extrapolating images to videos3. Setting up concepts for video processing using Convolutional Neural Networks4. Defining the dataset and the loss function5. Defining the model6. Training the model and run inference ; Chapter 8: Super-resolution through Upscaling & GANChapter Goal: This chapter deals with foundations on Generative Adversarial Networks in the field of computer vision. </subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a"> - The concepts will be extrapolated with an use-case to how it is being used in super resolution (Enhancing Image Quality)No of pages: 30Sub - Topics: 1. Establish the concept of upscaling in images1. Foundations of VAE and GAN in images2. Setting up codes in GAN for super resolution3. Using the concept to understand data augmentation using GAN; Chapter 9: Body Posture DetectionChapter Goal: This chapter will establish the concept of multiple body posture detection. It will have the code encompassed the detection and multiple methods around posture detection applications.No of pages: 30Sub - Topics: 1. Discussing top-down and bottom-up approach to detect persons2. Discuss open pose detection model to establish body pose3. Use of segmentation technique to detect body pose; Chapter 10: Explainable AI for Computer Vision using GRADCAMChapter Goal: This chapter deals with foundations on how a deep learning model results can be explained. </subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a"> - An overview of GRADCAM and how the concepts help someone explaining a Computer Vision model will be discussed in abundance.No of pages: 15Sub - Topics: 1. Revisit the concepts of explain-able AI2. Deep learning explainers to CV classification model3. Setting up concepts of GRADCAM4. Implementing how Computer Vision models can be interpreted by GRADCAM</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">bicssc</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">bicssc</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Python (Computer program language)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Artificial intelligence</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Machine learning</subfield></datafield><datafield tag="653" ind1=" " ind2=" "><subfield code="a">Hardcover, Softcover / Informatik, EDV/Informatik</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Shivananda, Adarsha</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)1177938383</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Sharma, Nitin Ranjan</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Online-Ausgabe</subfield><subfield code="z">978-1-4842-8273-1</subfield></datafield><datafield tag="999" ind1=" " ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-033824950</subfield></datafield></record></collection>
id	DE-604.BV048446729
illustrated	Illustrated
index_date	2024-07-03T20:29:26Z
indexdate	2024-07-10T09:38:21Z
institution	BVB
isbn	9781484282724
language	English
oai_aleph_id	oai:aleph.bib-bvb.de:BVB01-033824950
oclc_num	1347212967
open_access_boolean
owner	DE-29T
owner_facet	DE-29T
physical	xvi, 346 Seiten Illustrationen 557 grams
publishDate	2022
publishDateSearch	2022
publishDateSort	2022
publisher	Apress
record_format	marc
spelling	Kulkarni, Akshay Verfasser (DE-588)1177937689 aut Computer vision projects with PyTorch design and develop production-grade ,odels Akshay Kulkarni, Adarsha Shivananda, Nitin Ranjan Sharma New York Apress [2022] xvi, 346 Seiten Illustrationen 557 grams txt rdacontent n rdamedia nc rdacarrier Design and develop end-to-end, production-grade computer vision projects for real-world industry problems. This book discusses computer vision algorithms and their applications using PyTorch.The book begins with the fundamentals of computer vision: convolutional neural nets, RESNET, YOLO, data augmentation, and other regularization techniques used in the industry. And then it gives you a quick overview of the PyTorch libraries used in the book. After that, it takes you through the implementation of image classification problems, object detection techniques, and transfer learning while training and running inference. The book covers image segmentation and an anomaly detection model. And it discusses the fundamentals of video processing for computer vision tasks putting images into videos. The book concludes with an explanation of the complete model building process for deep learning frameworks using optimized techniques with highlights on model AI explainability.After reading this book, you will be able to build your own computer vision projects using transfer learning and PyTorch.What You Will Learn- Solve problems in computer vision with PyTorch.- Implement transfer learning and perform image classification, object detection, image segmentation, and other computer vision applications- Design and develop production-grade computer vision projects for real-world industry problems- Interpret computer vision models and solve business problems Chapter 1: Building Blocks of Computer VisionChapter Goal: The chapter will start with the basic concepts of Computer Vision. We will cover theoretical aspects that lays the foundation for the upcoming hands-on projects on Computer Vision. No of pages:35Sub -Topics1. Overview of Computer Vision2. Understanding AlexNET, Convolutional Neural Network and receptive fields3. Understanding advanced concepts like RESNETS and inception network4. Discuss how usage of batch normalization, drop outs, data augmentation techniques help solve data insufficiency in deep learning models5. Introduction to PyTorch for Computer Vision models; Chapter 2: Building Image Classification ModelChapter Goal: The chapter will discuss about image classification model along with data augmentation techniques.No of pages: 40Sub - Topics 1. Data preparation for image classification problem2. Data augmentation techniques3. Setting up model architecture with explanation4. - Train and run inference for the Image Classification model5. Discuss Grouped Convolution, Dilated Convolution and transposed convolution and their application; Chapter 3: Building Object Detection ModelChapter Goal: This chapter will explain the core difference between simple classification model to detecting objects in an image. We will understand optimizing loss function to get the final object localized and detected. The chapter will take through some concepts of the existing models and how to fine tune them.No of pages: 30Sub - Topics: 1. Exploring Object Detection concepts like FastRCNN, YOLO2. Explaining annotations and examples of how annotations are used in Object Detection3. Explaining loss function components4. Building Object Detection model, using transfer learning technique5. Running inference on fine-tuned model; Chapter 4: Building Image Segmentation ModelChapter Goal: The chapter will define how single or multiple images can be segmented in an image. - How a user can define a loss function and develop a model to segregate image outlines. No of pages: 35Sub - Topics: 1. Concepts on how segmentation works on Images2. Explaining custom pre trained models3. Defining and explaining loss functions4. Implementing & fine-tuning Image Segmentation model ; Chapter 5: Image Similarity & Image based SearchChapter Goal: The chapter deals with the explanation of how the image similarity works and how use cases move around this concept. No of pages: 25Sub - Topics: 1. Defining Image similarity and anomaly problems for images2. Defining the datasets3. Defining the loss functions and methodologies4. Providing solutions for Detecting Image similarities; Chapter 6: Image Anomaly DetectionChapter Goal: The chapter deals with the explanation of how anomalies from images can be detected and use-cases around it.No of pages: 20Sub - Topics: 1. Defining anomaly problems for images2. Defining the datasets3. Defining the loss functions and methodologies4. - Detecting anomalies on images ; Chapter 7: Video Processing Applications using PyTorchChapter Goal: This chapter deals with various mechanism of video processing techniques. This chapter will help one to deal with untangling the complexities of video with series of images placed in time sequence. Concepts of RNN/LSTM/GRU will be discussed to solve real time use-cases on videos.No of pages: 50Sub - Topics: 1. Setting up concepts of time dependent feature set2. Extrapolating images to videos3. Setting up concepts for video processing using Convolutional Neural Networks4. Defining the dataset and the loss function5. Defining the model6. Training the model and run inference ; Chapter 8: Super-resolution through Upscaling & GANChapter Goal: This chapter deals with foundations on Generative Adversarial Networks in the field of computer vision. - The concepts will be extrapolated with an use-case to how it is being used in super resolution (Enhancing Image Quality)No of pages: 30Sub - Topics: 1. Establish the concept of upscaling in images1. Foundations of VAE and GAN in images2. Setting up codes in GAN for super resolution3. Using the concept to understand data augmentation using GAN; Chapter 9: Body Posture DetectionChapter Goal: This chapter will establish the concept of multiple body posture detection. It will have the code encompassed the detection and multiple methods around posture detection applications.No of pages: 30Sub - Topics: 1. Discussing top-down and bottom-up approach to detect persons2. Discuss open pose detection model to establish body pose3. Use of segmentation technique to detect body pose; Chapter 10: Explainable AI for Computer Vision using GRADCAMChapter Goal: This chapter deals with foundations on how a deep learning model results can be explained. - An overview of GRADCAM and how the concepts help someone explaining a Computer Vision model will be discussed in abundance.No of pages: 15Sub - Topics: 1. Revisit the concepts of explain-able AI2. Deep learning explainers to CV classification model3. Setting up concepts of GRADCAM4. Implementing how Computer Vision models can be interpreted by GRADCAM bicssc bisacsh Python (Computer program language) Artificial intelligence Machine learning Hardcover, Softcover / Informatik, EDV/Informatik Shivananda, Adarsha Verfasser (DE-588)1177938383 aut Sharma, Nitin Ranjan Verfasser aut Erscheint auch als Online-Ausgabe 978-1-4842-8273-1
spellingShingle	Kulkarni, Akshay Shivananda, Adarsha Sharma, Nitin Ranjan Computer vision projects with PyTorch design and develop production-grade ,odels bicssc bisacsh Python (Computer program language) Artificial intelligence Machine learning
title	Computer vision projects with PyTorch design and develop production-grade ,odels
title_auth	Computer vision projects with PyTorch design and develop production-grade ,odels
title_exact_search	Computer vision projects with PyTorch design and develop production-grade ,odels
title_exact_search_txtP	Computer vision projects with PyTorch design and develop production-grade ,odels
title_full	Computer vision projects with PyTorch design and develop production-grade ,odels Akshay Kulkarni, Adarsha Shivananda, Nitin Ranjan Sharma
title_fullStr	Computer vision projects with PyTorch design and develop production-grade ,odels Akshay Kulkarni, Adarsha Shivananda, Nitin Ranjan Sharma
title_full_unstemmed	Computer vision projects with PyTorch design and develop production-grade ,odels Akshay Kulkarni, Adarsha Shivananda, Nitin Ranjan Sharma
title_short	Computer vision projects with PyTorch
title_sort	computer vision projects with pytorch design and develop production grade odels
title_sub	design and develop production-grade ,odels
topic	bicssc bisacsh Python (Computer program language) Artificial intelligence Machine learning
topic_facet	bicssc bisacsh Python (Computer program language) Artificial intelligence Machine learning
work_keys_str_mv	AT kulkarniakshay computervisionprojectswithpytorchdesignanddevelopproductiongradeodels AT shivanandaadarsha computervisionprojectswithpytorchdesignanddevelopproductiongradeodels AT sharmanitinranjan computervisionprojectswithpytorchdesignanddevelopproductiongradeodels

Holdings

There is no print copy available.

Interlibrary loan Place Request Caution: Not in THWS collection!

MARC

Record in the Search Index

There is no print copy available.

Similar Items