王丹丹,张来斌,储胜利,等. 基于SDA-YOLO和双目视觉的钻杆识别定位方法[J]. 石油钻采工艺,2026,48(2):139-147, 176. DOI: 10.13639/j.odpt.202509027
引用本文: 王丹丹,张来斌,储胜利,等. 基于SDA-YOLO和双目视觉的钻杆识别定位方法[J]. 石油钻采工艺,2026,48(2):139-147, 176. DOI: 10.13639/j.odpt.202509027
WANG Dandan, ZHANG Laibin, CHU Shengli, et al. Drill pipe recognition and localization method based on SDA-YOLO and stereo vision[J]. Oil Drilling & Production Technology, 2026, 48(2): 139-147, 176. DOI: 10.13639/j.odpt.202509027
Citation: WANG Dandan, ZHANG Laibin, CHU Shengli, et al. Drill pipe recognition and localization method based on SDA-YOLO and stereo vision[J]. Oil Drilling & Production Technology, 2026, 48(2): 139-147, 176. DOI: 10.13639/j.odpt.202509027

基于SDA-YOLO和双目视觉的钻杆识别定位方法

Drill pipe recognition and localization method based on SDA-YOLO and stereo vision

  • 摘要: 针对复杂井场环境下钻杆目标存在跨尺度特征难以精确感知,制约钻台自动化装备稳定运行的问题,提出了一种基于SDA-YOLO和双目视觉的钻杆识别与定位方法。在轻量化YOLO v8n-seg基础上,引入多层次特征融合模块(SDI)与注意力尺度序列融合模块(ASF-YOLO),并设计Inner-CIOU损失函数强化对小尺度目标的约束,构建了SDA-YOLO模型,为验证方法有效性,构建了包含1600张真实井场钻杆图像的数据集,并按8∶2的比例划分为训练集与测试集。实验结果表明,该模型的精确率、召回率、平均精度均值分别达到了99.7%、93.5%、98.0%,较基准模型YOLO v8n-seg分别提升了13.2、9.2和12.3个百分点,显著增强了目标钻杆的识别分割精度。此外,基于双目视觉理论,结合双目标定与立体校正,引入RAFT-Stereo立体匹配算法开展稠密视差估计,实现了钻杆内径圆心的高精度三维定位,平均绝对误差为7.82 mm。研究表明,该方法能够为钻台自动对中、智能上卸扣及协同搬运提供可靠的视觉感知支撑,进而提升钻台作业自动化和智能化水平。

     

    Abstract: In the context of complex drilling site environments, where the drilling rod targets exhibit cross-scale characteristics that are difficult to perceive accurately, thus limiting the stable operation of automated drilling rigs, a method for drill pipe recognition and localization based on SDA-YOLO and stereo vision is proposed. Building on the lightweight YOLO v8n-seg model, a multi-level feature fusion module (SDI) and an attention-based scale sequence fusion module (ASF-YOLO) are introduced, and an Inner-CIOU loss function is designed to strengthen the constraints on small-scale targets. The SDA-YOLO model is constructed. To validate the effectiveness of the method, a dataset consisting of 1600 real-field drill pipe images is created, which is divided into training and testing sets at a ratio of 8∶2. Experimental results show that the model achieves precision, recall, and mean average precision values of 99.7%, 93.5%, and 98.0%, respectively, which represent improvements of 13.2, 9.2, and 12.3 percentage points compared to the baseline YOLO v8n-seg model, significantly enhancing the recognition and segmentation accuracy of target drill pipes. Moreover, based on stereo vision theory, integrated with dual calibration and stereo rectification, the RAFT-Stereo stereo matching algorithm is introduced to perform dense disparity estimation, enabling high-precision three-dimensional localization of the drill pipe's inner diameter center, with an average absolute error of 7.82 mm. The study demonstrates that the proposed method can provide reliable visual perception support for automatic centering of drilling rig, intelligent make-up and break-out, and collaborative handling, thus promoting the automation and intelligence level of drilling rig operations.

     

/

返回文章
返回