摘要:
Road crack detection in complex scenarios is challenged by vehicles, traffic facilities, road printed signs and fine cracks. In order to better solve these problems, a novel dense nested depth U-shaped structure for crack image segmentation network named DUCTNet is proposed. Firstly, a depth dense nested structure is designed by combining the superior performance of the Unet $++$ dense nested structure and the deep nested structure of U2Net. This structure improves the ability of the model to extract crack features in depth. Second, a novel deep competitive fusion feature extraction block is proposed. It improves the feature dissimilarity between the cracks and the background by competitive fusion. Then, a novel high-density feature fusion attention mechanism is proposed. This method enhances the contextual and sensitive information of cracks both horizontally and vertically by increasing the feature density. Finally, DUCTNet achieves the best results in comparison tests with eight state-of-the-art specialized crack segmentation networks in both self-built datasets and four public datasets. In addition, DUCTNet achieves excellent results in real road tests, which proves that DUCTNet can provide engineers and technicians with a better means of detecting road cracks.
摘要:
Rice is a crucial agricultural crop, yet it frequently suffers from various diseases, leading to decreased yields and, in severe cases, crop failure. Diseases significantly affect rice growth and yield, resulting in economic losses and food security challenges. The role of image recognition in identifying rice diseases is critical in agricultural production. It enables automated and efficient detection of rice diseases, which is essential for effective management, ensuring food security and sustainable agriculture. To address issues like background noise and edge blurring in rice disease image capture, as well as challenges in determining the optimal learning rate during the training of traditional rice disease recognition networks, a novel method based on PSOC-DRCNet is proposed for rice disease recognition.. First, tto solve the problem of background interference, Dual Mode Attention (DMA) is proposed to adaptively capture meaningful regions in rice disease images. Second, the Residual Adaptive Block (RAB) is proposed, which utilizes dimensional changes and channel attention to solve edge blur problems. Then, a Cross entropy and regularized mixed Loss function (CerLoss), is proposed to optimize the learning strategy of the model in the process of processing datasets and enhance the performance and generalization ability of the model to avoid overfitting problems. Ultimately, In response to the cumbersome problem of finding the optimal learning rate, we propose using Particle Swarm Optimization Chameleon (PSOC) to find the optimal learning rate and train the PSOC-DRCNet model on our custom dataset and compare it with other existing methods and the final average classification accuracy of PSOC-DRCNet is 93.88% with an F1 score of 0.940. We compare it with other existing methods. It is proved that the average classification accuracy of our model under hyper-parameter unification is 92.65% F1 score is 0.928. We validated the PSOC-DRCNet by conducting comparative analyses with other models and through generalization experiments and module effectiveness tests. Additionally, the practicality of PSOC-DRCNet was confirmed through its application in real-world scenarios. The methods proposed in this paper successfully enable the identification of various diseases in rice leaves, offering a practical solution for incorporating deep learning into the agricultural production process. Furthermore, these findings serve as a valuable reference for disease identification in other crops.
摘要:
Image denoising remains a classic and crucial issue in the field of image processing, significantly impacting the outcomes of subsequent image processing tasks. For instance, the denoising network depends on “noise-clean” image pairs to train network effectively. However, it is often hampered by issues such as useful information loss, low training efficiency, and poor blind denoising. To address these challenges, this study proposes a novel image denoising network that integrates the complementary strengths of model-based and learning-based approaches, specifically leveraging the capabilities of both SDDW-GAN and CHRNet. Firstly, SDDW-GAN is designed to estimate the noise distribution on the input noisy images, and a fast-smoothing noisy block sampling algorithm is proposed to extract the noise blocks in noisy images in SDDW-GAN. Secondly, a network with dual generators and dual discriminators based on W-GAN is designed to estimate the noise distribution on the input noisy images and generate noise sample pairs with the same noise distribution, which solves the problem of relying on “noise-clean” image pairs. Thirdly, CHRNet is designed to compute the mapping relationship between the double-noise samples and the single-noise samples. In order to further improve the denoising effect, the dual-channel residual attention module is proposed for fusion learning of global and local features. Experimental results show that the proposed method has a better denoising effect in complex environments and outperforms existing denoising methods. Specifically, in comparison with the stand-alone denoising methods BM3D, DnCNN, Noise2Noise, and Blind2Unblind, the proposed method improves the average peak signal-to-noise ratio (PSNR) by 0.23 dB to 0.78 dB on two benchmarking datasets crossing different noise levels. Its denoising effect is also greater than other competitive stand-alone and combination methods. The proposed method can also extend to low-light image enhancement, deblurring, and super-resolution.
摘要:
<jats:p>Tomato leaf disease control in the field of smart agriculture urgently requires attention and reinforcement. This paper proposes a method called LAFANet for image-text retrieval, which integrates image and text information for joint analysis of multimodal data, helping agricultural practitioners to provide more comprehensive and in-depth diagnostic evidence to ensure the quality and yield of tomatoes. First, we focus on six common tomato leaf disease images and text descriptions, creating a Tomato Leaf Disease Image-Text Retrieval Dataset (TLDITRD), introducing image-text retrieval into the field of tomato leaf disease retrieval. Then, utilizing ViT and BERT models, we extract detailed image features and sequences of textual features, incorporating contextual information from image-text pairs. To address errors in image-text retrieval caused by complex backgrounds, we propose Learnable Fusion Attention (LFA) to amplify the fusion of textual and image features, thereby extracting substantial semantic insights from both modalities. To delve further into the semantic connections across various modalities, we propose a False Negative Elimination-Adversarial Negative Selection (FNE-ANS) approach. This method aims to identify adversarial negative instances that specifically target false negatives within the triplet function, thereby imposing constraints on the model. To bolster the model’s capacity for generalization and precision, we propose Adversarial Regularization (AR). This approach involves incorporating adversarial perturbations during model training, thereby fortifying its resilience and adaptability to slight variations in input data. Experimental results show that, compared with existing ultramodern models, LAFANet outperformed existing models on TLDITRD dataset, with top1, top5, and top10 reaching 83.3% and 90.0%, and top1, top5, and top10 reaching 80.3%, 93.7%, and 96.3%. LAFANet offers fresh technical backing and algorithmic insights for the retrieval of tomato leaf disease through image-text correlation.</jats:p>
摘要:
Combining an object-detection algorithm with an unmanned aerial vehicle (UAV) can accelerate the detection of road cracks. To address the difficulties of intricate crack morphology, similar color to the road, and small crack area, this paper describes a UAV road crack object-detection algorithm using MUENet. The MUENet is primarily comprised of a main and auxiliary dual-path module (MADPM), an uneven fusion structure with transpose and inception convolutions (TI-UFS) and a E-SimOTA strategy. First, the MADPM is proposed to efficiently extract the essential morphological features of cracks. Subsequently, the TI-UFS is proposed to explore potential crack color characteristics. Finally, the E-SimOTA strategy accurately differentiates different types of cracks and accelerates network training convergence. The experimental results demonstrate that MUENet has the double benefits of precision and speed on a self-built dataset of UAV near-far scene images (UNFSI). This object-detection algorithm is more adaptable to crack objects than other mainstream object-detection algorithms.
摘要:
Tomato leaf diseases have a significant impact on tomato cultivation modernization. Object detection is an important technique for disease prevention since it may collect reliable disease information. Tomato leaf diseases occur in a variety of environments, which can lead to intraclass variability and interclass similarity in the disease. Tomato plants are commonly planted in soil. When a disease occurs near the leaf's edge, the soil backdrop in the image tends to interfere with the infected region. These problems can make tomato detection challenging. In this paper, we propose a precise image-based tomato leaf disease detection approach using PLPNet. First, a perceptual adaptive convolution module is proposed. It can effectively extract the disease's defining characteristics. Second, a location reinforcement attention mechanism is proposed at the neck of the network. It suppresses the interference of the soil backdrop and prevents extraneous information from accessing the network's feature fusion phase. Then, a proximity feature aggregation network with switchable atrous convolution and deconvolution is proposed by combining the mechanisms of secondary observation and feature consistency. The network solves the problem of disease interclass similarities. Finally, the experimental results show that PLPNet achieved 94.5% mean average precision with 50% thresholds (mAP50), 54.4% average recall (AR), and 25.45 frames per second (FPS) on a self-built dataset. The model is more accurate and specific for the detection of tomato leaf diseases than other popular detectors. Our proposed method may effectively improve conventional tomato leaf disease detection and provide modern tomato cultivation management with reference experience.
关键词:
apple leaf diseases;complex background;tiny-object detection;HSSNet;TTALDD-4
摘要:
Apple leaf diseases are one of the most important factors that reduce apple quality and yield. The object detection technology based on deep learning can detect diseases in a timely manner and help automate disease control, thereby reducing economic losses. In the natural environment, tiny apple leaf disease targets (a resolution is less than 32 × 32 pixel(2)) are easily overlooked. To address the problems of complex background interference, difficult detection of tiny targets and biased detection of prediction boxes that exist in standard detectors, in this paper, we constructed a tiny target dataset TTALDD-4 containing four types of diseases, which include Alternaria leaf spot, Frogeye leaf spot, Grey spot and Rust, and proposed the HSSNet detector based on the YOLOv7-tiny benchmark for professional detection of apple leaf disease tiny targets. Firstly, the H-SimAM attention mechanism is proposed to focus on the foreground lesions in the complex background of the image. Secondly, SP-BiFormer Block is proposed to enhance the ability of the model to perceive tiny targets of leaf diseases. Finally, we use the SIOU loss to improve the case of prediction box bias. The experimental results show that HSSNet achieves 85.04% mAP (mean average precision), 67.53% AR (average recall), and 83 FPS (frames per second). Compared with other standard detectors, HSSNet maintains high real-time detection speed with higher detection accuracy. This provides a reference for the automated control of apple leaf diseases.
作者机构:
[Chenxi Deng] Ecological Livable College, Hunan University of Environment and Biology, Hengyang, China;[Guoxiong Zhou; Yiqing Cai] School of Computer and Information Engineering, Central South University of Forestry & Technology, Changsha, China
会议名称:
2023 International Conference on Advances in Electrical Engineering and Computer Applications (AEECA)
会议时间:
18 August 2023
会议地点:
Dalian, China
会议论文集名称:
2023 International Conference on Advances in Electrical Engineering and Computer Applications (AEECA)
摘要:
China is a powerful country with vast territory and abundant animal resources. At present, there are more than 400 kinds of national protected animals, and there are 1999 man-made reserves. Wildlife resources have important strategic significance. Real time detection and identification of wildlife is the main work of managers. Based on the research background of wildlife image detection, this paper analyzes and summarizes the traditional image detection methods, and proposes a wildlife detection and automatic recognition method based on fast r-cnn. In this paper, the tensorflow framework is downloaded and configured, and the collected wildlife images are processed and annotated. Secondly, the preprocessed image is reconstructed according to the format of voc2007 data set, and then the wildlife image is detected and recognized by using GPU based fast r-cnn framework. The experimental results show that this method can achieve fast detection and accurate recognition of wildlife.
作者机构:
[Zhao, Di] School of Information Science and Engineering, Hunan First Normal University, Hunan, Changsha;410205, China;[Liu, Jing] Department of Information Engineering, Hunan Vocational College of Engineering, Hunan, Changsha;410151, China;[Zhou, Guo-Xiong] School of Computer and Information Engineering, Central South Forestry University, Hunan, Changsha
通讯机构:
[Jing Liu] D;Department of Information Engineering, Hunan Vocational College of Engineering, Changsha, China
作者机构:
[Liu, Tao] Cent South Univ Forestry & Technol, Coll Civil Engn, Changsha, Hunan, Peoples R China.;[Zhang, Liangji; Cai, Chuang; Zhou, Guoxiong; Cai, Weiwei] Cent South Univ Forestry & Technol, Coll Comp & Informat Engn, Changsha, Hunan, Peoples R China.;[Li, Liujun] Univ Missouri, Dept Civil Architectural & Environm Engn, Rolla, MO 65401 USA.
摘要:
Crack is the external expression form of potential safety risks in bridge construction. Currently, automatic detection and segmentation of bridge cracks remains the top priority of civil engineers. With the development of image segmentation techniques based on convolutional neural networks, new opportunities emerge in bridge crack detection. Traditional bridge crack detection methods are vulnerable to complex background and small cracks, which is difficult to achieve effective segmentation. This study presents a bridge crack segmentation method based on a densely connected U-Net network (BC-DUnet) with a background elimination module and cross-attention mechanism. First, a dense connected feature extraction model (DCFEM) integrating the advantages of DenseNet is proposed, which can effectively enhance the main feature information of small cracks. Second, the background elimination module (BEM) is proposed, which can filter the excess information by assigning different weights to retain the main feature information of the crack. Finally, a cross-attention mechanism (CAM) is proposed to enhance the capture of long-term dependent information and further improve the pixel-level representation of the model. Finally, 98.18% of the Pixel Accuracy was obtained by comparing experiments with traditional networks such as FCN and Unet, and the IOU value was increased by 14.12% and 4.04% over FCN and Unet, respectively. In our non-traditional networks such as HU-ResNet and F U N-4s, SAM-DUnet has better and higher accuracy and generalization is not prone to overfitting. The BC-DUnet network proposed here can eliminate the influence of complex background on the segmentation accuracy of bridge cracks, improve the detection efficiency of bridge cracks, reduce the detection cost, and have practical application value.
摘要:
Inspired by that bird sound has various frequency distributions and continuous time-varying properties, a novel method is proposed for the classification of bird sound based on continuous frame sequence and spectrogram-frame linear network (SFLN). In order to form a continuous frame sequence as the standard input for SFLN, a sliding window algorithm of short frame length is suitable for differentiate the Mel-spectrogram of bird sound. The vertical 3D filter in the linear layer moves linearly along the continuous frame and cover its full frequency band. The weight is initialized to a Gaussian distribution to attenuate the high-and low-frequency noise, thereby extracting the long-and short-term features of the continuous frame of the bird sound. Finally, the GRU network is connected and used as a classifier to directly output the prediction results. Four kinds of bird sound from the xeno-canto website are tested to evaluate the influences of different parameters of sliding window on the effect of SFLN-based classification. In the comparison experiment, the mean average precision (MAP) achieves the highest value of 0.97.
摘要:
In this paper, a method for detecting rapid rice disease based on FCM-KM and Faster R-CNN fusion is proposed to address various problems with the rice disease images, such as noise, blurred image edge, large background interference and low detection accuracy. Firstly, the method uses a two-dimensional filtering mask combined with a weighted multilevel median filter (2DFM-AMMF) for noise reduction, and uses a faster two-dimensional Otsu threshold segmentation algorithm (Faster 2D-Otsu) to reduce the interference of complex background with the detection of target blade in the image. Then the dynamic population firefly algorithm based on the chaos theory as well as the maximum and minimum distance algorithm is applied for optimization of the K-Means clustering algorithm (FCM-KM) to determine the optimal clustering class k value while addressing the tendency of the algorithm to fall into the local optimum problem. Combined with the R-CNN algorithm for the identification of rice diseases, FCM-KM analysis is conducted to determine the different sizes of the Faster R-CNN target frame. As revealed by the application results of 3010 images, the accuracy and time required for detection of rice blast, bacterial blight and blight were 96.71%/0.65s, 97.53%/0.82s and 98.26%/0.53s, respectively, indicating clearly that the method is more capable of detecting rice diseases and improving the identification accuracy of Faster R-CNN algorithm, while reducing the time required for identification.