Client Background
- Client Name: A leading AI Data Center firm in the USA
- Industry Type: Engineering Analytics & AI Solutions
- Products & Services:
- Engineering document analysis
- Computer vision-based automation
- AI-driven validation systems
About Client:
- The client works with complex engineering datasets including:
- Electrical panel images
- Architectural drawings
- Multi-page technical PDFs
- Their workflows required high accuracy, scalability, and automation
- Objective:
- Reduce manual inspection
- Improve validation accuracy
- Automate structured data extraction
The Problem
- Parking management challenges:
- Manual monitoring of parking slots
- No real-time occupancy tracking
- Inefficient space utilization
- Difficulty in:
- Detecting vehicle presence
- Identifying misaligned parking
Our Solution
- Developed a real-time parking detection system using computer vision
- Key capabilities:
- Vehicle detection using YOLO
- Parking slot mapping using coordinates
- Occupancy detection (free vs occupied)
- Alignment detection (proper vs misaligned)
Solution Architecture
- Frame capture module:
- Live camera or video input
- Detection module:
- YOLO-based vehicle detection
- Processing:
- Slot boundary comparison
- IoU and inside-percentage calculations
- Logic layer:
- Occupancy classification
- State stabilization
- Visualization:
- OpenCV overlays including boxes, masks, and warnings
Deliverables
- Real-time parking detection system
- Slot occupancy tracking
- Visualization output (OpenCV-based)
- Configurable environment setup
Technical Challenges
- Accurate detection in real-time video streams
- Defining parking boundaries precisely
- Handling misaligned vehicles
- Ensuring stable detection over time
Solutions
- Polygon-based slot mapping
- Time-based stabilization logic
- IoU and percentage-based calculations
- Dynamic overlays for visualization
Business Impact
- Improved parking space utilization
- Enabled real-time monitoring
- Reduced manual supervision
- Scalable for smart city applications
Tech Stack (Across All Projects)
Frameworks
- FastAPI
- Modal (serverless compute)
Languages / Techniques
- Python
- Computer vision
- ETL pipelines
- OCR processing
Models Used
- YOLOv8 (object detection)
- OCR models (Gemini, DeepSeek tested)
- Vision Language Models (experimental)
Skills Used
- Data processing and automation
- Computer vision
- System design
- Backend development
- Problem-solving
Databases
- JSON outputs and structured datasets
- No external database in most pipelines
Cloud / Infrastructure
- Modal serverless compute
- Local and containerized (Docker) setups
Overall Business Impact
- Automated previously manual engineering workflows
- Improved accuracy and reduced human error
- Enabled scalable processing of large datasets
- Delivered structured and actionable insights
- Reduced operational time and effort significantly





















