Coder Social home page Coder Social logo

Dr. Farshid PirahanSiah

I am currently seeking opportunities in Berlin, Germany, focusing on the development of Large Language Models and advanced AI systems. My extensive experience in R&D for computer vision and multimodal applications positions me to effectively lead and innovate in this field.

Seeking Position in Berlin, Germany only:

  • Technical Lead in R&D for Computer Vision
  • End-to-end generative AI pipeline for cloud-based solutions
  • Multimodal LLMs for image processing applications
  • Large Vision Models (LVMs) for edge computing

Development of Generative AI Pipelines:

  • Engineered end-to-end, cloud-based generative AI solutions, overseeing the entire pipeline from data ingestion and model training to deployment and scaling.
  • Expertise in Multimodal Large Language Models (LLMs):
    • Specialized in integrating multimodal LLMs for image processing applications, enhancing system capabilities to interpret and analyze visual and textual data simultaneously.
  • Innovations with Large Vision Models (LVMs):
    • Developed and optimized Large Vision Models for edge computing, ensuring efficient processing and responsiveness in IoT devices.

Technical Expertise in Large Language Models (LLMs) and AI Development:

  • Multimodal RAG Systems: Led the development of Retriever-Augmented Generation (RAG) applications integrating text, image, and structured data, enhancing multimodal interaction capabilities.
  • Advanced AI Pipelines: Engineered end-to-end solutions for generative AI, leveraging cloud-based architectures to deploy scalable and efficient AI systems.
  • Deep Learning Implementation: Proficient in implementing complex deep learning models, with extensive use of libraries such as PyTorch, OpenAI's GPT models, and langchain for sophisticated text and image processing tasks.
  • Data Handling and Processing: Experienced in manipulating large-scale datasets, implementing custom extraction and partition techniques for PDF data integration, utilizing Python's robust libraries like PyPDF2 and pytesseract for OCR functionalities.
  • Optimization Techniques: Applied advanced machine learning techniques including hyper-parameter tuning, quantization, and model compression to enhance performance and efficiency on target hardware platforms, particularly in edge computing scenarios.
  • AI Model Deployment: Skilled in deploying AI models using Docker, managing environments with dependencies including langchain, unstructured, PyPDF2, and various OpenAI services, ensuring smooth transition from development to production.
  • Research and Development: Authored comprehensive documentation and guides, effectively summarizing research findings and technical processes, demonstrated through detailed GitHub repositories and Jupyter notebooks.
  • AI-Powered Summarization: Developed capabilities for summarizing diverse data elements (text, tables, images) using AI-driven approaches, significantly improving information accessibility and user engagement.
  • Community Contribution and Collaboration: Actively engaged in community forums and collaborative projects, contributing to open-source projects and providing innovative solutions to complex problems in the AI space.

I am eager to bring my expertise to a dynamic team in Berlin, where I can contribute to groundbreaking projects and further advance the field of artificial intelligence.

Patents and Publications

  • Patents: A METHOD FOR AUGMENTING A PLURALITY OF FACE IMAGES - 2021
    • The present invention relates to a method for increasing data for face analysis in video surveillance.
    • WO2021060971A1
  • Patents: A METHOD FOR DETECTING A MOVING VEHICLE - 2021
    • The present invention relates to a method for detecting a moving vehicle.
    • WO2021107761
  • Patents: System and method for providing advertisement contents based on facial analysis - 2020
    • Invented an algorithm, methods, and system for advanced facial attribute detection, leading to improvements in advertising systems.
    • WO2020141969A2 WIPO (PCT)
  • Book Chapter: Camera Calibration and Video Stabilization for Robot Localization, Springer, 2021.
  • Authored over 16 publications in books, journals, and conferences globally.

My works on LLMs:

  • Image Processing GPT
  • MindMap about LLMs & LLMOps
  • Code for chat app with OpenRouter's AI! 🚀 Utilize asyncio and aiohttp for seamless conversations and manage interactions with a smart queue. Dive into the future of chat applications now!"
  • fine-tune LLMs
  • Microsoft AI Lab: RAG Workflow with Azure AI
    • Lab Focus: Hands-on RAG workflow development using Azure AI Studio and Prompt Flow.
    • Skills Acquired: Mastery in LLMOps, Azure AI Studio usage, and Prompt Flow integration.
    • Tools Used: GitHub Codespaces, Visual Studio Code, Azure AI & ML Studio, Azure Portal.
    • Outcome: Successfully developed and deployed "Contoso Chat", enhancing skills in scalable AI solution development.

Professional Profiles and Networks

Coding Challenges and Competitions

Academic Contributions and Publications

Project Repositories and Code Sharing

Social Media and Community Engagement

Content Creation and Sharing

Learning and Development Platforms

Reading and Literature

Image Processing Test for C++

My GitHub about Advanced Programming with Modern C++ 23 for Image Processing

https://github.com/pirahansiah/cvtest

The first function is int func_image_info(cv::Mat src, cv::Mat &dst /*output*/) this function show information about image such as size, histogram, ....

YouTube link for OpenCV: https://www.youtube.com/watch?v=gK1ybsWOqhs

Multi-camera calibration

Stereo camera calibration# Stereo camera calibration

Computer Vision, Deep Learning, AI Metaverse

I have 6+ years of experience as a computer vision research engineer in three multinational companies in two continents, strengthened by my academic background with a Master’s and PhD in Computer Science (Computer Vision). My expertise includes Technical Lead R&D, Software Specialists Image Processing - Medical Devices, Computer Vision with Machine Learning (Object Detection, Video Tracking), IoT, and Robotics; and I am experienced in designing algorithms for Image Thresholding, Optical Flow, Camera Calibration, and Stereo Vision. Lastly, I have a track record in creating effective metrics, building end-to-end pipelines, and writing production-level codes with OpenCV and Deep Learning frameworks (Caffe, TensorFlow, PyTorch).

FarshidPirahanSiah
I am interested in Metaverse, Medicine. I am interested in 3D Camera Calibration for extended reality headset in Metaverse.
I have experience in computer vision, deep learning and robotic.
I am familiar with IoT and Edge computing, Medical devices, cloud base solution (AWS), robotic.

pirahansiah's GitHub stats Top Langs

Platform for metaverse
AR/VR Frameworks Engineer For New Application Paradigm

Camera Calibration

Last Updated 29.Jan.2022

Geometric camera calibration, also referred to as camera re-sectioning, estimates the parameters of a lens and image sensor of an image or video camera. These parameters can be used to correct for lens distortion, measure the size of an object in world units, or determine the location of the camera in a scene. These tasks are used in applications such as machine vision to detect and measure objects. They are also used in robotics, navigation systems, and 3-D scene reconstruction. Without any knowledge of the calibration of the cameras, it is impossible to do better than projective reconstruction (MathWorks).

Non-intrusive scene measurement tasks, such as 3D reconstruction, object inspection, target or self-localization or scene mapping require a calibrated camera model (Orghidan et al. 2011). Camera calibration is the process of approximating the parameters of a pinhole camera model (Tsai 1987; Stein 1995; Heikkila & Silven 1997) of a given photograph or video.

Camera self-calibration, also known as auto/fully calibration method, is not reliant upon the calibration reference object of a camera. Three-dimensional reconstruction and motion estimation are two fundamental tasks in computer vision (Kaehler & Bradski 2016). In both tasks, camera calibration is an essential step that bridges the 2D imaging plane and 3D space. For the past decade, camera calibration has been heavily investigated in the fields of computer vision and optics (Anuar et al. 2015; Garg & Deep 2015; Hong et al. 2015; Jia et al. 2015). Maybank and Faugeras (1992) introduced the concept of camera self-calibration. However, the self-calibration method is nonlinear and highly sensitive to noises; these methods can be enhanced by using active vision, where some specific camera motions are designed, such as pure rotation, orthogonal translations (Wang et al. 2004). For example, Hartley proposed using pure rotation to compute the infinite homography, then linearly calibrate the camera (Hartley & Zisserman 2003). However, the constraints on the specific motions are too strong to satisfy in practice, which hinder them from wider applications (Lei et al. 2004). For example, it is difficult to perform pure rotation around the camera’s optical center, even with a pure rotation platform, because it is difficult to obtain the camera’s optical center and even more difficult to coincide the camera’s optical center with the rotation center of a rotation platform. Furthermore, some researchers tried to improve self-calibration using more constraints, such as module constraint and loop constraint (Courchay et al. 2012). Another category of calibration methods is usually based on specific calibration rig or scene constraints (Liming et al. 2013).

The first step for camera calibration is corner detection. Based on my research, the calibration pattern image play important rule in the whole calibration process.

  1. Camera calibration for multi-modal robot vision based on image quality assessment https://www.researchgate.net/profile/Farshid-Pirahansiah/publication/288174690_Camera_calibration_for_multi-modal_robot_vision_based_on_image_quality_assessment/links/5735bc2908aea45ee83c999e/Camera-calibration-for-multi-modal-robot-vision-based-on-image-quality-assessment.pdf

  2. Pattern image significance for camera calibration https://ieeexplore.ieee.org/abstract/document/8305440

  3. Camera Calibration and Video Stabilization Framework for Robot Localization https://link.springer.com/chapter/10.1007/978-3-030-74540-0_12

  • CV_metaverse
    • 3D_multi_camera_calibration
      • corner_Detection
        • cornerDetection.ipynb
      • auto multi camera calibration

Top source code:

- cornerDetection.ipynb
    
    - It use several preprocessing and postprocessing steps to enhance corner detection use by camera calibration.

    - 3D multi camera calibration require detect and set points for all camera together  

    - if the calibration pattern images are not good, blur, ... it need to enhance it first then use corner points to detect and use for calibration process

#computervision #AI #objectdetection #objecttracking #ml #research #CNN #gans #convolutionalneuralnetworks #ai #vr #reinforcementlearning #mlops #aiforbusiness #science #researcher #phd #cameracalibration #opticalflow #videostablization #humanoidrobot #localization #3dSLAM #reconstruction #pointcloud #mixedreality #edgecomputing #raspberrypi #intelstick #googlecoral #jetsonnano #nvidiavgpu #tensorflowjs #pytorch #opencv #aikit #caffee #DIGITS #c++ #python #ubuntu #farshidpirahansiah #tiziran.com #farshid #pirahansiah #robotics #tiziran.com #farshid #pirahansiah #MultiCameraMultiClassMultiObjectTracking #deeplearning #machinelearning #artificialintelligence #tensorflow #robotics #3dvision #sterovision #depthmap #RCNN #machinevision #imageprocessing #patternrecognition #compiler #RISC-V #RNN #fullStackDeepLearning #productinnovation #patents #TensorRT #ApacheTVM #TFLite #PyTorchmobile #dockers #gRPC #RESTAPIs #GRPC #GraphQL #imageprocessing #patternrecognition #EnablingEfficient #high-performance #Accelerators #Optimization #computervision #AI #objectdetection #objecttracking #ml #research #CNN #gans #convolutionalneuralnetworks #ai #vr #reinforcementlearning #mlops #aiforbusiness #science #researcher #phd #cameracalibration #opticalflow #videostablization #humanoidrobot #localization #3dSLAM #reconstruction #pointcloud #AR/VR #mixedreality #edgecomputing #raspberrypi #intelstick #googlecoral #jetsonnano #nvidiavgpu #tensorflowjs #pytorch #opencv #aikit #caffee #DIGITS #c++ #python #ubuntu #farshidpirahansiah #tiziran.com #farshid #pirahansiah #robotics #SingleObjecttracking #SOT #MultiObjecttracking #MOT #MultiTargetTracking #MTT #MultiClassMultiObjecttracking #MCMOT #MultiCameraMultiClassMultiObjectTracking #MCMCMOT #deeplearning #machinelearning #artificialintelligence #computervision #video #objectdetection #objecttracking #tensorflow #innovation #learning #datascience #robotics #3dvision #sterovision #depthmap #RCNN #machinevision #imageprocessing #patternrecognition #compiler #RISC-V #RNN #fullStackDeepLearning #productinnovation #patents #TensorRT #ApacheTVM #TFLite #PyTorchmobile #TensorFlow.js #CoreML #MLkit #DataDog #NewRelic #AmazonCloudWatch #dockers #gRPC #RESTAPIs #GRPC #GraphQL #farshidpirahansiah #tiziran.com #farshid #pirahansiah #robotics #SingleObjecttracking #SOT #MultiObjecttracking #MOT #MultiTargetTracking #MTT #MultiClassMultiObjecttracking #MCMOT #MultiCameraMultiClassMultiObjectTracking #MCMCMOT #deeplearning #machinelearning #artificialintelligence #computervision #video #objectdetection #objecttracking #tensorflow #innovation #learning #datascience #robotics #3dvision #sterovision #depthmap #RCNN #objectdetection #objecttracking #ml #research #CNN #gans #convolutionalneuralnetworks #ai #vr #reinforcementlearning #mlops #aiforbusiness #science #researcher #phd #cameracalibration #opticalflow #videostablization #humanoidrobot #localization #3dSLAM #reconstruction #pointcloud #AR/VR #mixedreality #edgecomputing #raspberrypi #intelstick #googlecoral #jetsonnano #nvidiavgpu #tensorflowjs #pytorch #opencv #aikit #caffee #DIGITS #c++ #python #ubuntu #machinevision #imageprocessing #patternrecognition #SingleObjecttracking #SOT #MultiObjecttracking #MOT #MultiTargetTracking #MTT #MultiClassMultiObjecttracking #MCMOT #MultiCameraMultiClassMultiObjectTracking #MCMCMOT #deeplearning #machinelearning #artificialintelligence #computervision #video #objectdetection #objecttracking #tensorflow #innovation #learning #datascience #robotics #3dvision #sterovision #depthmap #RCNN #objectdetection #objecttracking #ml #research #CNN #GAN

Farshid PirahanSiah, PhD.'s Projects

awesome-llm icon awesome-llm

Awesome-LLM: a curated list of Large Language Model

cd4ml-scenarios icon cd4ml-scenarios

Repository with sample code and instructions for "Continuous Intelligence" and "Continuous Delivery for Machine Learning: CD4ML" workshops

chatdev icon chatdev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

contoso-chat icon contoso-chat

This sample has the full End2End process of creating RAG application with Prompt Flow and AI Studio. It includes GPT 3.5 Turbo LLM application code, evaluations, deployment automation with AZD CLI, GitHub actions for evaluation and deployment and intent mapping for multiple LLM task mapping.

cv-ml-pipline icon cv-ml-pipline

Computer Vision with Machine Learning Pipline: Docker, AWS, Kubernetes, TensorFlow, Seldon, Kubeflow, ...

cvtest icon cvtest

Unit Test, Integration Test, System Test, Acceptance Test for Computer Vision and Deep Learning

fairmot icon fairmot

A simple baseline for one-shot multi-object tracking

farshid icon farshid

farshid pirahansiah workshop computer vision

opencv icon opencv

OpenCV 3 , Visual C++ 2015 , win 64x , computer vision, image processing, webcam,video,motion,frame,edge,vector image processing with opencv 3 & c++ Find Faces , Modify Videos and Photos Automatically , Identify, Count & Measure , Realtime Augmented Reality Technology, An introduction to Image Processing, Tutorial Real-Time Object Tracking Using OpenCV, Face Features Detection System, Fast Object Tracking in C++ using OpenCV, How to install OpenCV and Create Sample Project in Visual Studio, Camera calibration With OpenCV, Chessboard or asymmetrical circle pattern, Installing OpenCV 3.2.0 with Visual Studio 2015 and configuring OpenCV project, OpenCv Stereo Vision, Machine Learning,Getting Started with Neural Network,Pattern Recognition and Application,Computer Vision Real-time Pattern Recognition using C++,Deep Neural Networks for Speech and Image Processing,Introduction to Segmentation,Deep Learning,Computer Vision & Machine Learning, Pattern Recognition, Camera Calibration, Optical Flow, Humanoid Robot, Image Processing, iOS developer, Augmented Reality, C++, Java, Matlab, keywords: digital image processing, OpenCV C++ Computer program tutorial, augmented virtual reality (augmented reality), deep machine learning, computer vision with C++ Programming Language, JSON, machine vision, opencv stereo camera calibration, optical flow, video analysis, Image Processing (IP) [OpenCV], Intelligent Systems, Deep Learning, Artificial Intelligence (AI) www.tiziran.com پردازش تصویر , روبوتیک , برنامه نویسی موبایل

opencv34 icon opencv34

OpenCV 3.4 Visual Studio 2015 without CUDA

opencv4 icon opencv4

Deep Learning for Computer Vision;TensorFlow, Caffe, OpenCV 4, Visual Studio 2017 (C++), Windows 10 x64 bit;https://www.youtube.com/tiziran

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.