Shivendra Upadhyay’s Post

Sr Manager : Data Science, Genrative AI, LLM, NLP, ChatBots,LLMops ,Semantic Search, Consulting

1mo

Learn how an image-text multi-modality model can perform image classification, image retrieval, and image captioning #llms #llmops #image #opencv #chatgpt4 #gemini #aiml https://lnkd.in/gJFh9Ruf

How Does an Image-Text Multimodal Foundation Model Work

towardsdatascience.com

To view or add a comment, sign in

More Relevant Posts

Towards Data Science

630,954 followers
1mo Edited
Report this post
Learn how an image-text multi-modality model can perform image classification, image retrieval, and image captioning - by Yi Wei

How Does an Image-Text Foundation Model Work

towardsdatascience.com
Like Comment
To view or add a comment, sign in
Praveen Reddy

Data Engineer (AWS , Azure ) | ML , DL , NLP | GenAI | Flutter
2mo
Report this post
Practiced several scenarios like (text to image, image to text, audio to text, text summarisation , image captioning ) using Open Source Hugging Face Models. Repo Link :: https://lnkd.in/ggyuacMu
Like Comment
To view or add a comment, sign in
Ajinkya Kolhe

AI, LLMs & Deep Learning
2mo
Report this post
Simple Image Generation Pipeline End to End. Going through Huggingface Diffusion Course & it allows you to get upto speed in Deep Learning very quickly. End to End Pipelines are important because Deep Learning is a iterative & experimentation heavy domain. If we don't do End to End pipeline, we will never move forward. So a simple pipeline and we will customize it further to fine tune Stable Diffusion. Just in 8 lectures you can reach until coding your own Midjourney. https://lnkd.in/dcJ2Ue9p #deeplearning #huggingface #generativeai

Simple Image Generation Pipeline with Diffusers

https://www.youtube.com/
Like Comment
To view or add a comment, sign in
Pyresearch

1,476 followers
7mo
Report this post
Exciting News! Learn all about Image Classification in our latest YouTube video! In this video, we dive deep into the world of image classification, exploring the fascinating technology that powers it. Whether you're a tech enthusiast or just curious about how computers 'see' images, this is a must-watch! https://lnkd.in/db6N-zfz... Don't miss out! Hit the link above, subscribe to our channel, and join us on this visual journey. Let's unravel the mysteries of image classification together! #ImageClassification #TechExploration #YouTubeLearning #ComputerVision

Image classification

https://www.youtube.com/
Like Comment
To view or add a comment, sign in
Rimaz Rauf

Digital Marketer • AI Consultant • Content Creator • Coder
6mo
Report this post
Let's learn how to create this trending profile image using 4 FREE AI Tools. #aiimagegenerator #aitools #adobefirefly #dalle #chatgpt

I Tried to Create this With All 4 Image Generators And The WINNER Is!

https://www.youtube.com/
Like Comment
To view or add a comment, sign in
Masamune Ishihara

Computer Engineering Undergrad
1mo
Report this post
🚀 Created a presentation titled "Understand Stable Diffusion from Code."🌟 Rather than dealing in theory of LDM, I'll explain through working code how the images are generated. I'm also planning to write a blog about this in the future. 💡 Highlights: - A clear breakdown of the 4-step process in Latent Diffusion Models. - Animation of the denoising process. - All the codes in the slides are executable. I also publish the code to create animations. 🔗 Check it out here: https://lnkd.in/dNdcXTuX #StableDiffusion #MachineLearning #AI #ImageGeneration #DeepLearning #TechInnovation #DataScience

Understand Stable Diffusion from Code

masaishi.github.io
Like Comment
To view or add a comment, sign in
Incubity

3,974 followers
1mo
Report this post
Discover the power of Multimodal Knowledge Graphs in integrating structured knowledge from text, images, sound, and video. Learn about their construction, applications in fields like image classification and visual question answering, and the challenges they present. Explore how these advanced graphs enable a comprehensive understanding of complex concepts and relationships across various domains. Read the full article here: https://lnkd.in/gtAY3iYN #generativeai #knowledgegraph
Like Comment
To view or add a comment, sign in
Shivangi Verma

--frontend developer| learning DSA| 4 ⭐ hackerank C++
4mo
Report this post
🎉 Exciting Project Alert! 🎉 🤚🔍 Delighted to share my latest project on Hand Detection using state-of-the-art computer vision techniques! 🖥️👀 In this project, I developed a robust hand detection system leveraging deep learning models and advanced image processing algorithms. 💡💻 🔍 Key Features: 1️⃣ Accurate Hand Localization: Implemented a multi-stage pipeline for precise hand detection in various environmental conditions. 2️⃣ Real-time Performance: Optimized the model architecture and inference process to achieve high-speed processing suitable for real-time applications. 3️⃣ Robustness: Designed the system to handle challenges such as occlusions, varying lighting conditions, and diverse hand shapes and sizes. 4️⃣ Integration Ready: Developed APIs for seamless integration with existing applications and frameworks. 🚀 Applications: Gesture Recognition: Enabling intuitive interaction with digital interfaces. Human-Computer Interaction: Facilitating natural communication between humans and machines. Augmented Reality: Enhancing immersive experiences through hand gestures in AR applications. 🌟 Technologies Used: Python, OpenCV, TensorFlow Convolutional Neural Networks (CNNs) Image Processing Techniques 📈 Results: The developed system achieved an impressive accuracy rate of over 90% on standard benchmark datasets. Additionally, it demonstrated exceptional performance in real-world scenarios, showcasing its potential for practical applications. 🤝 Let's Connect! Excited to share more about this project and explore potential collaborations in computer vision, AI, and related fields. Feel free to reach out and let's innovate together! 🚀 #ComputerVision #AI #HandDetection #DeepLearning #MachineLearning #ImageProcessing #ArtificialIntelligence #Python #OpenCV #TensorFlow #GestureRecognition #AR #CV #Innovation #Technology #Engineering #LinkedIn #ProjectShowcase

17.03.2024_18.05.50_REC

screenrec.com

2 Comments
Like Comment
To view or add a comment, sign in
Yogesh Jadhav

Data Enthusiast | Data Analyst | Data Science | ML/DL/AI | Analytics | Visualization | ETL | UI/UX | NFT | Power Apps | IT | Content Writer | Jobs/Recruitment | Quoran | Follow for more
8mo
Report this post
📰 High-Dynamic Video Generation: Researchers have developed a new approach called PixelDance that improves video generation by incorporating image instructions along with text instructions. Current methods focusing on text-to-video generation often produce video clips with minimal motions. PixelDance, based on diffusion models, sets a higher standard for video synthesis by creating videos with complex scenes and intricate motions. #ArtificialIntelligence #MachineLearning #ComputerVision #VideoGeneration

📰 High-Dynamic Video Generation: Researchers have developed a new approach called PixelDance that improves video generation by incorporating image instructions along with text instructions. Current methods focusing on text-to-video generation often produce video clips with minimal motions. PixelDance, based on diffusion models, sets a higher standard for video synthesis by creating videos wi...

arxiv.org
Like Comment
To view or add a comment, sign in
Oleksandr S.

User Experience / Product Designer · Transportation & Mobility Solutions · End-to-End B2B, B2C, Web & IOS & Android App
7mo
Report this post
Just found a cool site that shows how language models work, from start to finish. It's got explanations and formulas and all that tech stuff. Works best on a computer. https://bbycroft.net/llm #languagemodels #techinsights #aiexplained l #innovationinaction #DigitalTrends #userexperiencedesign #TechTalks #designthinking #smarttechsolutions

LLM Visualization

bbycroft.net
Like Comment
To view or add a comment, sign in

8,879 followers

843 Posts

View Profile Follow

Shivendra Upadhyay’s Post

More Relevant Posts

Simple Image Generation Pipeline with Diffusers

https://www.youtube.com/

Image classification

https://www.youtube.com/

I Tried to Create this With All 4 Image Generators And The WINNER Is!

https://www.youtube.com/

Explore topics