Coder Social home page Coder Social logo

microsoft / cameratraps Goto Github PK

View Code? Open in Web Editor NEW
738.0 47.0 238.0 426.54 MB

PyTorch Wildlife: a Collaborative Deep Learning Framework for Conservation.

Home Page: https://cameratraps.readthedocs.io/en/latest/

License: MIT License

Python 1.76% Jupyter Notebook 38.02% Shell 0.05% Python 20.74% Jupyter Notebook 38.02% Shell 0.39% Dockerfile 0.02% HTML 0.37% C# 0.52% Starlark 0.10% Batchfile 0.01%
camera-traps wildlife conservation machine-learning computer-vision megadetector pytorch pytorch-wildlife

cameratraps's Introduction

image

A Collaborative Deep Learning Framework for Conservation



πŸ“£ Announcement

πŸ€œπŸ€› Collaboration with EcoAssist!

We are thrilled to announce our collaboration with EcoAssist---a powerful user interface software that enables users to directly load models from the PyTorch-Wildlife model zoo for image analysis on local computers. With EcoAssist, you can now utilize MegaDetectorV5 and the classification models---AI4GAmazonRainforest and AI4GOpossum---for automatic animal detection and identification, alongside a comprehensive suite of pre- and post-processing tools. This partnership aims to enhance the overall user experience with PyTorch-Wildlife models for a general audience. We will work closely to bring more features together for more efficient and effective wildlife analysis in the future.

πŸŽοΈπŸ’¨πŸ’¨ SMALLER, BETTER, and FASTER! MegaDetectorV6 public beta testing started!

The public beta testing for MegaDetectorV6 has officially started! In the next generation of MegaDetector, we are focusing on computational efficiency and performance. We have trained multiple new models using the latest YOLO-v9 architecture, and in the public beta testing, we will allow people to test the compact version of MegaDetectorV6 (MDv6-c). We want to make sure these models work as expected on real-world datasets.

This MDv6-c model has only one-sixth (SMALLER) of the parameters of the current MegaDetectorV5 and exhibits 12% higher recall (BETTER) on animal detection in our validation datasets. In other words, MDv6-c has significantly fewer false negatives when detecting animals, making it a more robust animal detection model than MegaDetectorV5. Furthermore, one of our testers reported that the speed of MDv6-c is at least 5 times FASTER than MegaDetectorV5 on their datasets.

Models Parameters Precision Recall
MegaDetectorV5 121M 0.96 0.73
MegaDetectroV6-c 22M 0.92 0.85

We are also working on an extra-large version of MegaDetectorV6 for optimal performance and a transformer-based model using the RT-Detr architecture to prepare ourselves for the future of transformers. These models will be available in the official release of MegaDetectorV6.

If you want to join the beta testing, please come to our discord channel and DM the admins there:

πŸŽ‰ Pytorch-Wildlife ready for citation

In addition, we have recently published a summary paper on Pytorch-Wildlife. The paper has been accepted as an oral presentation at the CV4Animals workshop at this year's CVPR. Please feel free to cite us!

πŸ› οΈ Compatibility with CUDA 12.x

The new version of PytorchWildlife uses the latest version of Pytorch (currently 2.3.1), which is compatible with CUDA 12.x.

βœ… Feature highlights (Version 1.0.2.15)

  • Added a file separation function. You can now automatically separate your files between animals and non-animals into different folders using our detection_folder_separation function. Please see the Python demo file and Jupyter demo!
  • πŸ₯³ Added Timelapse compatibility! Check the Gradio interface or notebooks.
πŸ‘‰ Click for more
  • CUDA 12.x compatibility.
  • Added Google Colab demos.
  • Added Snapshot Serengeti classification model into the model zoo.
  • Added Classification fine-tuning module.
  • Added a Docker Image for ease of installation.
  • πŸ”₯ Future highlights

    • MegaDetectorV6 with multiple model sizes for both optimized performance and low-budget devices like camera systems (Public beta testing has started!!).
    • Supervision 0.19+ and Python 3.10+ compatibility.
    • A detection model fine-tuning module to fine-tune your own detection model for Pytorch-Wildlife.
    • Direct LILA connection for more training/validation data.
    • More pretrained detection and classification models to expand the current model zoo.

    To check the full version of the roadmap with completed tasks and long term goals, please click here!.

    🐾 Introduction

    At the core of our mission is the desire to create a harmonious space where conservation scientists from all over the globe can unite. Where they're able to share, grow, use datasets and deep learning architectures for wildlife conservation. We've been inspired by the potential and capabilities of Megadetector, and we deeply value its contributions to the community. As we forge ahead with Pytorch-Wildlife, under which Megadetector now resides, please know that we remain committed to supporting, maintaining, and developing Megadetector, ensuring its continued relevance, expansion, and utility.

    Pytorch-Wildlife is pip installable:

    pip install PytorchWildlife
    

    To use the newest version of MegaDetector with all the existing functionalities, you can use our Hugging Face interface or simply load the model with Pytorch-Wildlife. The weights will be automatically downloaded:

    from PytorchWildlife.models import detection as pw_detection
    detection_model = pw_detection.MegaDetectorV5()

    For those interested in accessing the previous MegaDetector repository, which utilizes the same MegaDetector v5 model weights and was primarily developed by Dan Morris during his time at Microsoft, please visit the archive directory, or you can visit this forked repository that Dan Morris is actively maintaining.

    Tip

    If you have any questions regarding MegaDetector and Pytorch-Wildlife, please email us or join us in our discord channel:

    πŸ‘‹ Welcome to Pytorch-Wildlife Version 1.0

    PyTorch-Wildlife is a platform to create, modify, and share powerful AI conservation models. These models can be used for a variety of applications, including camera trap images, overhead images, underwater images, or bioacoustics. Your engagement with our work is greatly appreciated, and we eagerly await any feedback you may have.

    The Pytorch-Wildlife library allows users to directly load the MegaDetector v5 model weights for animal detection. We've fully refactored our codebase, prioritizing ease of use in model deployment and expansion. In addition to MegaDetector v5, Pytorch-Wildlife also accommodates a range of classification weights, such as those derived from the Amazon Rainforest dataset and the Opossum classification dataset. Explore the codebase and functionalities of Pytorch-Wildlife through our interactive HuggingFace web app or local demos and notebooks, designed to showcase the practical applications of our enhancements at PyTorchWildlife. You can find more information in our documentation.

    πŸ‘‡ Here is a brief example on how to perform detection and classification on a single image using PyTorch-wildlife

    import torch
    from PytorchWildlife.models import detection as pw_detection
    from PytorchWildlife.models import classification as pw_classification
    
    img = torch.randn((3, 1280, 1280))
    
    # Detection
    detection_model = pw_detection.MegaDetectorV5() # Model weights are automatically downloaded.
    detection_result = detection_model.single_image_detection(img)
    
    #Classification
    classification_model = pw_classification.AI4GAmazonRainforest() # Model weights are automatically downloaded.
    classification_results = classification_model.single_image_classification(img)

    βš™οΈ Install Pytorch-Wildlife

    pip install PytorchWildlife
    

    Please refer to our installation guide for more installation information.

    πŸ•΅οΈ Explore Pytorch-Wildlife and MegaDetector with our Demo User Interface

    If you want to directly try Pytorch-Wildlife with the AI models available, including MegaDetector v5, you can use our Gradio interface. This interface allows users to directly load the MegaDetector v5 model weights for animal detection. In addition, Pytorch-Wildlife also has two classification models in our initial version. One is trained from an Amazon Rainforest camera trap dataset and the other from a Galapagos opossum classification dataset (more details of these datasets will be published soon). To start, please follow the installation instructions on how to run the Gradio interface! We also provide multiple Jupyter notebooks for demonstration.

    image

    πŸ› οΈ Core Features

    What are the core components of Pytorch-Wildlife? Pytorch-core-diagram

    🌐 Unified Framework:

    Pytorch-Wildlife integrates four pivotal elements:

    β–ͺ Machine Learning Models
    β–ͺ Pre-trained Weights
    β–ͺ Datasets
    β–ͺ Utilities

    πŸ‘· Our work:

    In the provided graph, boxes outlined in red represent elements that will be added and remained fixed, while those in blue will be part of our development.

    πŸš€ Inaugural Model:

    We're kickstarting with YOLO as our first available model, complemented by pre-trained weights from MegaDetector v5. This is the same MegaDetector v5 model from the previous repository.

    πŸ“š Expandable Repository:

    As we move forward, our platform will welcome new models and pre-trained weights for camera traps and bioacoustic analysis. We're excited to host contributions from global researchers through a dedicated submission platform.

    πŸ“Š Datasets from LILA:

    Pytorch-Wildlife will also incorporate the vast datasets hosted on LILA, making it a treasure trove for conservation research.

    🧰 Versatile Utilities:

    Our set of utilities spans from visualization tools to task-specific utilities, many inherited from Megadetector.

    πŸ’» User Interface Flexibility:

    While we provide a foundational user interface, our platform is designed to inspire. We encourage researchers to craft and share their unique interfaces, and we'll list both existing and new UIs from other collaborators for the community's benefit.

    Let's shape the future of wildlife research, together! πŸ™Œ

    πŸ“ˆ Progress on core tasks

    β–ͺ️ Packaging
    • Animal detection fine-tuning
    • MegaDetectorV5 integration
    • MegaDetectorV6 integration
    • User submitted weights
    • Animal classification fine-tuning
    • Amazon Rainforest classification
    • Amazon Opossum classification
    • User submitted weights

    β–ͺ️ Utility Toolkit
    • Visualization tools
    • MegaDetector utils
    • User submitted utils

    β–ͺ️ Datasets
    • Animal Datasets
    • LILA datasets

    β–ͺ️ Accessibility
    • Basic user interface for demonstration
    • UI Dev tools
    • List of available UIs

    πŸ–ΌοΈ Examples

    Image detection using MegaDetector v5

    animal_det_1
    Credits to Universidad de los Andes, Colombia.

    Image classification with MegaDetector v5 and AI4GAmazonRainforest

    animal_clas_1
    Credits to Universidad de los Andes, Colombia.

    Opossum ID with MegaDetector v5 and AI4GOpossum

    opossum_det
    Credits to the Agency for Regulation and Control of Biosecurity and Quarantine for GalΓ‘pagos (ABG), Ecuador.

    Cite us

    @misc{hernandez2024pytorchwildlife,
          title={Pytorch-Wildlife: A Collaborative Deep Learning Framework for Conservation}, 
          author={Andres Hernandez and Zhongqi Miao and Luisa Vargas and Rahul Dodhia and Juan Lavista},
          year={2024},
          eprint={2405.12930},
          archivePrefix={arXiv},
          primaryClass={cs.CV}
    }
    

    🀝 Contributing

    This project is open to your ideas and contributions. If you want to submit a pull request, we'll have some guidelines available soon.

    We have adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact us with any additional questions or comments.

    License

    This repository is licensed with the MIT license.

    πŸ‘₯ Existing Collaborators

    The extensive collaborative efforts of Megadetector have genuinely inspired us, and we deeply value its significant contributions to the community. As we continue to advance with Pytorch-Wildlife, our commitment to delivering technical support to our existing partners on MegaDetector remains the same.

    Here we list a few of the organizations that have used MegaDetector. We're only listing organizations who have given us permission to refer to them here or have posted publicly about their use of MegaDetector.

    πŸ‘‰ Full list of organizations

    (Newly Added) TerrOΓ―ko (OCAPI platform)

    Arizona Department of Environmental Quality

    Blackbird Environmental

    Camelot

    Canadian Parks and Wilderness Society (CPAWS) Northern Alberta Chapter

    Conservation X Labs

    Czech University of Life Sciences Prague

    EcoLogic Consultants Ltd.

    EstaciΓ³n BiolΓ³gica de DoΓ±ana

    Idaho Department of Fish and Game

    Island Conservation

    Myall Lakes Dingo Project

    Point No Point Treaty Council

    Ramat Hanadiv Nature Park

    SPEA (Portuguese Society for the Study of Birds)

    Synthetaic

    Taronga Conservation Society

    The Nature Conservancy in Wyoming

    TrapTagger

    Upper Yellowstone Watershed Group

    Applied Conservation Macro Ecology Lab, University of Victoria

    Banff National Park Resource Conservation, Parks Canada(https://www.pc.gc.ca/en/pn-np/ab/banff/nature/conservation)

    Blumstein Lab, UCLA

    Borderlands Research Institute, Sul Ross State University

    Capitol Reef National Park / Utah Valley University

    Center for Biodiversity and Conservation, American Museum of Natural History

    Centre for Ecosystem Science, UNSW Sydney

    Cross-Cultural Ecology Lab, Macquarie University

    DC Cat Count, led by the Humane Rescue Alliance

    Department of Fish and Wildlife Sciences, University of Idaho

    Department of Wildlife Ecology and Conservation, University of Florida

    Ecology and Conservation of Amazonian Vertebrates Research Group, Federal University of AmapΓ‘

    Gola Forest Programma, Royal Society for the Protection of Birds (RSPB)

    Graeme Shannon's Research Group, Bangor University

    Hamaarag, The Steinhardt Museum of Natural History, Tel Aviv University

    Institut des Science de la ForΓͺt TempΓ©rΓ©e (ISFORT), UniversitΓ© du QuΓ©bec en Outaouais

    Lab of Dr. Bilal Habib, the Wildlife Institute of India

    Mammal Spatial Ecology and Conservation Lab, Washington State University

    McLoughlin Lab in Population Ecology, University of Saskatchewan

    National Wildlife Refuge System, Southwest Region, U.S. Fish & Wildlife Service

    Northern Great Plains Program, Smithsonian

    Quantitative Ecology Lab, University of Washington

    Santa Monica Mountains Recreation Area, National Park Service

    Seattle Urban Carnivore Project, Woodland Park Zoo

    Serra dos Γ“rgΓ£os National Park, ICMBio

    Snapshot USA, Smithsonian

    Wildlife Coexistence Lab, University of British Columbia

    Wildlife Research, Oregon Department of Fish and Wildlife

    Wildlife Division, Michigan Department of Natural Resources

    Department of Ecology, TU Berlin

    Ghost Cat Analytics

    Protected Areas Unit, Canadian Wildlife Service

    School of Natural Sciences, University of Tasmania (story)

    Kenai National Wildlife Refuge, U.S. Fish & Wildlife Service (story)

    Australian Wildlife Conservancy (blog, blog)

    Felidae Conservation Fund (WildePod platform) (blog post)

    Alberta Biodiversity Monitoring Institute (ABMI) (WildTrax platform) (blog post)

    Shan Shui Conservation Center (blog post) (translated blog post)

    Irvine Ranch Conservancy (story)

    Wildlife Protection Solutions (story, story)

    Road Ecology Center, University of California, Davis (Wildlife Observer Network platform)

    The Nature Conservancy in California (Animl platform)

    San Diego Zoo Wildlife Alliance (Animl R package)


    Important

    If you would like to be added to this list or have any questions regarding MegaDetector and Pytorch-Wildlife, please email us or join us in our Discord channel:

    cameratraps's People

    Contributors

    aa-hernandez avatar aa-hernandez-celis avatar aench2023 avatar agentmorris avatar amritagupta avatar arashno avatar beerys avatar bencevans avatar brianhogg avatar chrisyeh96 avatar chunting-linct avatar coreyjaskolski avatar dependabot[bot] avatar fedegonzal avatar jakob-98 avatar joejynwan avatar kant avatar louis030195 avatar luvargas2 avatar marcel-simon avatar marcelsimon avatar microsoft-github-policy-service[bot] avatar msftgits avatar oksachi avatar pflickin avatar ranjanbalappa avatar suhail14298 avatar vardhan-duvvuri avatar yangsiyu007 avatar zhmiao avatar

    Stargazers

     avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

    Watchers

     avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

    cameratraps's Issues

    Continuing the training of MegaDetector

    Hi all! That's a lot for making this fantastic work available, the collective usefulness of the tools provided is truly staggering.

    I was trying out MegaDetector and it's working really well. However, it consistently misses some smaller animals that are partially obscured in the grass (so, false negatives), and sometimes has a tendency to pick up inanimate objects like tree stumps as animals (false positives). Confidence thresholding doesn't help to weed out those false positives as it comes at a high price - basically, I'd have to set a threshold close to unity and miss on a lot of robust animal detection that way.

    I see that there are checkpoints available for the 4.1 detector version, and was wondering if there's a code snippet one can use for continuing the training?

    Alternatively (a bit of a general question, sorry), are they any good strategies to mitigate the issues above?

    google.protobuf.message.DecodeError: Error parsing message

    hello, i got some errors when run your code when refering to the file of megadetector_colab.ipynb
    ##########################
    !python run_tf_detector_batch.py megadetector_v4_1_0.pb "$images_dir" "$output_file_path" --recursive

    Then the error occurs, the messages are showed as follow:
    ...
    GPU:0 with 10672 MB memory) -> physical GPU (device: 0, name: Tesla K80, pci bus id: 0000:90:00.0, compute capability: 3.7)
    2020-09-28 13:04:05.656622: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1115] Created TensorFlow device (/device:GPU:1 with 10672 MB memory) -> physical GPU (device: 1, name: Tesla K80, pci bus id: 0000:91:00.0, compute capability: 3.7)
    2020-09-28 13:04:05.657306: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1115] Created TensorFlow device (/device:GPU:2 with 10672 MB memory) -> physical GPU (device: 2, name: Tesla K80, pci bus id: 0000:94:00.0, compute capability: 3.7)
    2020-09-28 13:04:05.657562: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1115] Created TensorFlow device (/device:GPU:3 with 10672 MB memory) -> physical GPU (device: 3, name: Tesla K80, pci bus id: 0000:95:00.0, compute capability: 3.7)
    tf.test.is_gpu_available: True
    3597 image files found in the input directory
    TFDetector: Loading graph...
    Traceback (most recent call last):
    File "run_tf_detector_batch.py", line 375, in
    main()
    File "run_tf_detector_batch.py", line 357, in main
    n_cores=args.ncores)
    File "run_tf_detector_batch.py", line 166, in load_and_run_detector_batch
    tf_detector = TFDetector(model_file)
    File "/data1/user_datas/tianxianhao/DeepLearning/CameraTrap/CameraTraps/detection/run_tf_detector.py", line 147, in init
    detection_graph = TFDetector.__load_model(model_path)
    File "/data1/user_datas/tianxianhao/DeepLearning/CameraTrap/CameraTraps/detection/run_tf_detector.py", line 212, in __load_model
    od_graph_def.ParseFromString(serialized_graph)
    google.protobuf.message.DecodeError: Error parsing message
    ###################
    how to solve this problem? please.

    TFLite Converter Errors

    I am attempting to port the frozen .pb file to a TFLite model for use with the Raspberry Pi and the Google Coral TPU USB Coprocessor and ran into issues running the conversion. I am new to TensorFlow and, while I'm a Software Engineer, I'm using this as a user more than a developer, so bear with me here.

    The command I run and samples of the, very verbose, output are below. I know this may not be at the top of your priority list, but if it helps motivation, if this works out we could use MegaDetector on camera traps in the field to detect animals in real-time in the field, like for a predator detector I'm looking for for our farm. I'm able to crunch a detection in 45 seconds on my CPU but am eager to see what the Coral TPU can do. Thanks in advance for any help you're able to provide and I'm happy to roll up my sleeves and dig in if I can get a nudge in the right direction.

    Here's the command I run:
    tflite_convert --output_file=md_v4.1.0.pb.tflite --graph_def_file md_v4.1.0.pb --input_arrays=image_tensor --output_arrays=detection_boxes,detection_scores,detection_classes --input_shapes=1,640,480,3 --enable_v1_converter

    Traceback (most recent call last): File "c:\anaconda3\envs\py36\lib\runpy.py", line 193, in _run_module_as_main "main", mod_spec) File "c:\anaconda3\envs\py36\lib\runpy.py", line 85, in run_code exec(code, run_globals) File "C:\anaconda3\envs\py36\Scripts\tflite_convert.exe_main.py", line 7, in File "c:\anaconda3\envs\py36\lib\site-packages\tensorflow\lite\python\tflite_convert.py", line 442, in main app.run(main=run_main, argv=sys.argv[:1]) File "c:\anaconda3\envs\py36\lib\site-packages\tensorflow\python\platform\app.py", line 125, in run _sys.exit(main(argv)) File "c:\anaconda3\envs\py36\lib\site-packages\tensorflow\lite\python\tflite_convert.py", line 438, in run_main _convert_model(tflite_flags) File "c:\anaconda3\envs\py36\lib\site-packages\tensorflow\lite\python\tflite_convert.py", line 191, in _convert_model output_data = converter.convert() File "c:\anaconda3\envs\py36\lib\site-packages\tensorflow\lite\python\lite.py", line 455, in convert **converter_kwargs) File "c:\anaconda3\envs\py36\lib\site-packages\tensorflow\lite\python\convert.py", line 442, in toco_convert_impl input_data.SerializeToString()) File "c:\anaconda3\envs\py36\lib\site-packages\tensorflow\lite\python\convert.py", line 205, in toco_convert_protos "TOCO failed. See console for info.\n%s\n%s\n" % (stdout, stderr)) tensorflow.lite.python.convert.ConverterError: TOCO failed. See console for info. 2020-12-10 21:12:31.516113: I tensorflow/lite/toco/import_tensorflow.cc:1324] Converting unsupported operation: TensorArrayV3 2020-12-10 21:12:31.516570: E tensorflow/core/framework/op_kernel.cc:1325] OpKernel ('op: "WrapDatasetVariant" device_type: "CPU"') for unknown op: WrapDatasetVariant 2020-12-10 21:12:31.516902: E tensorflow/core/framework/op_kernel.cc:1325] OpKernel ('op: "WrapDatasetVariant" device_type: "GPU" host_memory_arg: "input_handle" host_memory_arg: "output_handle"') for unknown op: WrapDatasetVariant 2020-12-10 21:12:31.517337: E tensorflow/core/framework/op_kernel.cc:1325] OpKernel ('op: "UnwrapDatasetVariant" device_type: "CPU"') for unknown op: UnwrapDatasetVariant 2020-12-10 21:12:31.517673: E tensorflow/core/framework/op_kernel.cc:1325] OpKernel ('op: "UnwrapDatasetVariant" device_type: "GPU" host_memory_arg: "input_handle" host_memory_arg: "output_handle"') for unknown op: UnwrapDatasetVariant 2020-12-10 21:12:31.518258: I tensorflow/lite/toco/import_tensorflow.cc:1373] Unable to determine output type for op: TensorArrayV3 2020-12-10 21:12:31.518503: I tensorflow/lite/toco/import_tensorflow.cc:1324] Converting unsupported operation: TensorArrayScatterV3 2020-12-10 21:12:31.518752: I tensorflow/lite/toco/import_tensorflow.cc:1373] Unable to determine output type for op: TensorArrayScatterV3 2020-12-10 21:12:31.519004: I tensorflow/lite/toco/import_tensorflow.cc:1324] Converting unsupported operation: TensorArrayV3 2020-12-10 21:12:31.519201: I tensorflow/lite/toco/import_tensorflow.cc:1373] Unable to determine output type for op: TensorArrayV3 2020-12-10 21:12:31.519404: I tensorflow/lite/toco/import_tensorflow.cc:1324] Converting unsupported operation: TensorArrayV3 2020-12-10 21:12:31.519606: I tensorflow/lite/toco/import_tensorflow.cc:1373] Unable to determine output type for op: TensorArrayV3 2020-12-10 21:12:31.519863: I tensorflow/lite/toco/import_tensorflow.cc:1324] Converting unsupported operation: Enter 2020-12-10 21:12:31.520066: I tensorflow/lite/toco/import_tensorflow.cc:1373] Unable to determine output type for op: Enter

    ..........

    2020-12-10 21:12:31.887365: I tensorflow/lite/toco/import_tensorflow.cc:1373] Unable to determine output type for op: TensorArrayGatherV3 2020-12-10 21:12:34.795091: I tensorflow/lite/toco/graph_transformations/graph_transformations.cc:39] Before Removing unused ops: 8696 operators, 17295 arrays (0 quantized) 2020-12-10 21:12:37.623228: I tensorflow/lite/toco/graph_transformations/graph_transformations.cc:39] After Removing unused ops pass 1: 8662 operators, 17241 arrays (0 quantized) 2020-12-10 21:12:41.875052: I tensorflow/lite/toco/graph_transformations/graph_transformations.cc:39] Before general graph transformations: 8662 operators, 17241 arrays (0 quantized) 2020-12-10 21:12:42.990606: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_1/Conv2d_1x1/Conv2D". 2020-12-10 21:12:43.114359: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_2/Conv2d_1x1/Conv2D". 2020-12-10 21:12:43.249639: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_3/Conv2d_1x1/Conv2D". 2020-12-10 21:12:43.371587: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_4/Conv2d_1x1/Conv2D". 2020-12-10 21:12:43.490386: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_5/Conv2d_1x1/Conv2D". 2020-12-10 21:12:43.610138: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_6/Conv2d_1x1/Conv2D". 2020-12-10 21:12:43.729914: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_7/Conv2d_1x1/Conv2D". 2020-12-10 21:12:43.845808: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_8/Conv2d_1x1/Conv2D". 2020-12-10 21:12:43.961710: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_9/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.079812: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_10/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.198136: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_11/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.316202: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_12/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.430921: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_13/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.545152: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_14/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.657000: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_15/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.767624: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_16/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.878775: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_17/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.987341: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_18/Conv2d_1x1/Conv2D". 2020-12-10 21:12:45.098353: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_19/Conv2d_1x1/Conv2D". 2020-12-10 21:12:45.207493: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_20/Conv2d_1x1/Conv2D". 2020-12-10 21:12:45.215230: F tensorflow/lite/toco/graph_transformations/resolve_constant_slice.cc:59] Check failed: dim_size >= 1 (0 vs. 1)

    Release v4.1

    Hey! Im trying to use the current release (v4.1)

    When running the data_management/databases/classification/make_classificiation_dataset.py it tries to locate a create_tfrecords_v3 which is seems by the comments to be a script to create tfrecords when using python 3 or higher.

    However, this create_tfrecords_v3 does not seem to exist.

    I am using the environment-detector.yml conda setup.

    Is the batch processing API and the real-time API running the same classifiers ?

    Looking into classifying some thermal images to filter out empty images and roughly label when there is are different objects in the pictures. Will process around 3000 images per day and need to label 3-4 different categories - empty , animal, human

    Wondering if the live web API or the batch processing would be a better option?
    The readme says that the web API runs species classifiers so wondering if they are the same for the batch processing

    Also do you use some public discord or slack?

    Tutorial is out of date

    From the main README.md, in section classification (https://github.com/microsoft/CameraTraps#classification),
    it states "here's a tutorial on training your own classifier using our detector and our training pipeline." and points to the archived TUTORIAL.md (https://github.com/microsoft/CameraTraps/blob/master/archive/classification_marcel/TUTORIAL.md).

    However, the tutorial seems to be far out of sync with the readme in https://github.com/microsoft/CameraTraps/blob/master/classification/README.md.

    I am still unsure how to run the training with my dataset with the current master branch.

    multi-GPU inference

    Is there an easy way to use multiple GPUs for inference for the MegaDetector? That could speed things up quite a bit for large datasets.

    tflite version

    Hi, that you for all the work done on this model and for sharing it with freely. I am wanting to use it in a real-time situation i.e. to spot monkeys in my vegetable patch. I was hoping to get it working on a rasp pi 4 with Coral Edge TPU. The Coral can only use tensorflow lite and so I paid someone to try convert the MegaDetector into a version that could run on the Coral TPU but that was unfortunately unsuccessful. My question is where you think there is any change or releasing a version that might run on a pi / Coral TPU / tflite?

    File "path/run_tf_detector.py", line 54, in <module> from ct_utils import truncate_float ModuleNotFoundError: No module named 'ct_utils'

    Hi, I'm a wildlife researcher who is very new to scripting/command line interfaces, so I'm sorry if this seems simple!

    I'm kind of confused, I was trying to run a quick demo of the megadetector_v3 on some camera trap imagery but can't seem to get the run_tf_detector.py script working due to 'ct_utils' in the root folder not being successfully referenced. I've made sure that my working directory is the root file of the git clone (CameraTraps) and I've tried running it in both 'cameratraps' and 'cameratraps-detector' environments. I've tried moving ct_utils into the detector sub-folder and vice-versa with no success.

    Any advice on how to get the ct_utils script referenced when running the run_tf_detector.py script?

    I'm following the directions found on: https://github.com/microsoft/CameraTraps/blob/master/megadetector.md

    Thanks!

    pretrained weights for megadetector v5?

    Thanks for continuing to maintain this awesome set of tools! My question is about where to find the latest release. I see megadetector v5 referenced here: https://github.com/microsoft/CameraTraps/blob/main/detection/README.md

    But the latest release is for 4.1 https://github.com/microsoft/CameraTraps/releases/tag/v4.1

    Are the weights available somewhere else other than the release page? We might be using Megadetector soon in addition to https://github.com/microsoft/SpeciesClassification to detect people, mountain lions, foxes, coyotes, and deer and I was looking to use the most up to date models published here.

    Detection Issues

    Dear authors of MegaDetector,

    I would be very grateful, if you could help me and my colleagues with a very persistent issue:
    We applied MegaDetector (run_tf_detector_batch.py) for a large amount of camera trap imagery. Yet we noticed that for many cases, the locations of the individual bounding boxes, stored in the resulting JSON-file, do not represent the animals’ actual position within the image. Thus, we ran the test version (run_tf_detector.py) for a collection of images. Yet the bounding boxes in the resulting β€œ..._detections”-images just matched perfectly with the animal’s location. Consequently, we performed the batch-version on folders only containing one of these test images and the resulting JSON-files contained the exact same values as after the first run while still not matching the animals position.
    We performed these steps on several different PCs (computers and laptops) as well as with different operating systems (Ubuntu and Windows). However, the issue persists.

    We would be extremely grateful for your advice and help.

    Best regards,
    Tobias Pelchen (e-mail: [email protected])

    detection issue.pdf

    Successful run of visualize_detector_output.py outputs empty file "anno_"

    After successfully running the run_tf_detector_batch.py on a test-subset of 10 images I'm getting stuck after running visualize_detector_output.py.

    I'm able to successfully run visualize_detector_output.py as referenced by this log:

    C:\git\cameratraps>python visualization\visualize_detector_output.py G:\Datasets\Test\test.json G:\Datasets\Annotations.jpg -i G:\Datasets\Test\test.json

    Options to the script: Namespace(confidence=0.8, detector_output_path='G:\Datasets\Test\test.json', images_dir='G:\Datasets\Test\test.json', out_dir='G:\Datasets\Annotations\.jpg', output_image_width=700, random_seed=None, sample=-1, sas_url=None)
    detection_categories provided
    Detector output file contains 10 entries.
    Starting to annotate the images...
    100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 10/10 [00:00<00:00, 17.83it/s]
    Rendered detection results on 10 images, saved to G:\Datasets\Annotations.jpg.

    But the output of the script is an empty file "anno_G" with no discernible extension.

    Where have I gone wrong!

    make_classification_dataset with custom train/test split

    Thanks for making this great repo!

    I love that make_classification_dataset.py allows you to crop images using the megadetector and create the tfrecords at the same time. One thing that has not been clear to me is how to specify a specific train/test split rather than having the script automatically split across locations.

    Is there a way to do this?

    MegaDetector crashes on VM instance

    I'm trying to run the MegaDetector over some images on a GCP VM instance with a Tesla T4 GPU. I created a conda virtual environment using the environment-detector.yml file with the tensorflow-gpu version set to 1.13.1, and I am trying to run the run_tf_detector_batch.py script as instructed in the MegaDetector page. However, the process crashes while processing the first image (below is a screenshot of my terminal window when that happens):

    Screen Shot 2021-03-25 at 1 34 27 AM

    I've tried to trace the program but haven't been able to figure out what the issue is. Any idea why this might be happening and how to solve it? Thanks!

    [building a potential Edge AI solution] As an ecologist, what are you looking for to extract from camera traps?

    Hey all,

    Not sure if it falls under an issue (I'm happy to post or x-post it somewhere more suitable). This repo may be the best place to ask some questions about what's needed to be integrated in camera traps, since individuals with technical backgrounds, with some understanding on research needs, are likely to stumble upon the repo.

    I'm an AI for Earth grantee (agritech and their integration within smart cities). I learned about camera traps during the AI for Earth Summit, and coincidentally, as a part of an online competition (Microsoft's Discover AI Upskilling Journey) where organizers have asked participants to look into helping out with camera traps.

    However, what is not particularly clear is, what exactly are ecologists looking to extract out of camera trap images?
    Animals (detection and classifying which animal)?
    Other objects related to poaching (people, cars, trucks)?
    Other features (weather conditions, time stamps of when detection happens)?

    Understanding what needs to be extracted can help us (researchers and hobbyists) understand what needs to be developed, as well as tailor the hardware and integration to better fit what is needed.

    Overview of my project:

    The goal of this project is to highlight how effective Edge AI can be in eliminating/filtering out a significant number of inter-steps between data collection and obtaining classification and detection results, potentially along with other data/features that need to be collected or extracted.

    Presentation1
    Picture legend:
    green arrows -> working
    red arrows -> not working as intended
    blue arrows -> have not tried it yet int this project, but tested out and working in other projects

    More information:
    The project aims to tie-in features from machine learning and internet of things with hardware devices, similar in some aspects to what is currently used on the ground, in order to reduce the overall time spent ecologists spend on classifications, collecting data, etc... As someone who has never interacted with ecologists, it's difficult to understand what is currently needed for them to better allocate their time and focus on research aspects, rather than countless hours on manual data labelling.

    Model: I trained a custom object detection and classification (Yolov4) model for 17 classes, which are all the classes of animals that I was able to manually make out in the MegaDetector demo video. The classes are Armadillo, Bear, Bird, Bull, [Car], Cat, Cattle, Deer, Dog, Fox, Monkey, [Person], Pig, Raccoon, Sheep, Tiger, [Truck]. I added a few classes in [brackets], just in case someone would find the model useful as is.

    The images used for training are obtained from Open Images Dataset V6.

    Then, I used the model to run object detection and classification on some test images (the sheep image in the overview is one of the test images used) and the MegaDetector demo video.

    Testing images (after ~5hrs training):
    Yolov4 TFLite (TensorFlow Lite model, converted from yolov4)
    yolov4 TFLite

    Yolov4
    yolov4

    Video:
    Running the yolov4 model on the MegaDetector demo video (less than 2 minutes for the model to detect and classify the video).

    It seems to be pretty ok for some parts, needs more training for other parts.
    Some examples for detection/classification:
    Screen Shot 2020-06-25 at 10 56 42 AM
    Screen Shot 2020-06-25 at 10 56 16 AM
    Screen Shot 2020-06-25 at 10 55 52 AM
    Screen Shot 2020-06-25 at 10 58 01 AM

    Some examples for bad detection/classification (model needs more training?):
    Screen Shot 2020-06-25 at 10 59 07 AM
    Screen Shot 2020-06-25 at 10 57 10 AM

    Extracting more information from images

    I was reading that ecologists also look at camera trap images to extract more information about the ecosystem, such as weather conditions.

    My hand-waving approach was to use Custom Vision to train weather data. I stumbled upon a Multi-class Weather Dataset for Image Classification, which contains 4 classes of weather conditions. The dataset was used for training.

    Screen Shot 2020-06-25 at 11 18 21 AM

    Screen Shot 2020-06-25 at 11 19 00 AM

    However, testing the model on some images, the model does not seem to be the best for camera-traps. Even though some classifications were on point, the accuracy is really low.

    Screen Shot 2020-06-25 at 11 19 22 AM

    Screen Shot 2020-06-25 at 11 25 14 AM

    Another hand-waving approach to extract weather data from camera-trap images (haven't implemented this yet):

    It seems that some images display temperature data in each image. Another approach is to extract temperature information as well as time/date stamps from images (OCR). Then use historical weather datasets (such as openweather API) to fetch other weather features such as humidity, wind direction/speed, weather conditions (rainy, cloudy, partially sunny, sunny), etc..

    Raspberry pi as an Edge AI Device

    Screen Shot 2020-06-22 at 5 38 48 PM

    As implied earlier, the main reason to convert the Yolov4 model to a TensorFlow Lite model is for it to be run on the raspberry pi. This means that as an image or video is taken, object detection and classification happens on the device itself, where the results can be either uploaded directly to the internet or saved on an SD card.

    As the weather information is relatively more challenging to extract directly from images and videos, integrating relatively cheap sensors may provide a more custom overview of the ecosystem. For example, it is possible to add sensors to collect data about emissions such as COx, NOx, NH3, VOCs as well as air quality. It would also allow for more customizability when it comes to choosing a camera that may be a better fit for custom applications, as well as detection sensors (in this example, pyroelectric/ infrared sensor in addition to an ultrasonic sensor).

    cannot import name 'TFDetector' from 'detection.run_detector

    using colab

    Traceback (most recent call last):
      File "run_tf_detector_batch.py", line 66, in <module>
        from detection.run_detector import ImagePathUtils, TFDetector
    ImportError: cannot import name 'TFDetector' from 'detection.run_detector' (/content/CameraTraps/detection/run_detector.py)
    

    Limit number of cores used

    I'm using the batch detector on a HPC and need to limit the number of cores used to the number requested. If I used the --cores the batch detector creates a pool of workers which uses a lot more memory and does not support logging or checkpointing. Is there a way to limit the number of cores used without creating a workerpool? I think it's related to set_inter_op_parallelism_threads, but I'm not sure how to set tensorflow config from run_tf_detector_batch.py.

    No module named 'detection'

    when i try to run the code on local pc i got this error
    Traceback (most recent call last): File "CameraTraps\detection/run_detector_batch.py", line 79, in <module> from detection.run_detector import ImagePathUtils, is_gpu_available, load_detector ModuleNotFoundError: No module named 'detection'

    Batch inference with classifier?

    What is the appropriate way to do batch inference using a trained classifier? The classifier documentation lists three methods:

    1. predict_image.py
    2. detect_and_predict_image.py
    3. generate_sample_predictions.py

    The first appears not to support batch inference. #2 and #3 do batch inference, but running each of them resulted in the entire data set attempting to be loaded into memory. What is the correct way to perform inference on a large number of images using this repo?

    Are the MDv5 pt files "normal" YOLOv5 model files?

    Are the MDv5 pt files "normal" YOLOv5/PyTorch model files? Aka can I use my existing scripts for running YOLOv5 models to run MDv5? or is there something custom about the pt files that I need to be aware of?

    Method to look up request IDs from the API

    The API rocks! It would be handy to have a method to list a table (or JSON) of all the request ids with date submitted, date completed, job name, number of images, and brief status (e.g. failed, completed, processing). Maybe be able to limit to a date range or job name matcher?

    Apple M1 Support

    Hi All - very excited about this release!

    I know this is probably deep down in the dependencies but wanted to raise given that M1 chips are becoming more common and that Mac instructions are given in the README.

    The following error occurred running run_detector.py

    Intel MKL FATAL ERROR: This system does not meet the minimum requirements for use of the Intel(R) Math Kernel Library.
    The processor must support the Intel(R) Supplemental Streaming SIMD Extensions 3 (Intel(R) SSSE3) instructions.
    The processor must support the Intel(R) Streaming SIMD Extensions 4.2 (Intel(R) SSE4.2) instructions.
    The processor must support the Intel(R) Advanced Vector Extensions (Intel(R) AVX) instructions.
    

    Program seems to hit a dead-lock when using multiple cores

    Hello,
    As I wanted to increase the performance of the program run_tf_detector_batch.py, I used the option --ncores you provided. I tried it with 2 or 4 cores but each time, the program is stuck. Here are the last shell outputs I get:

    Creating pool with 2 cores
    TFDetector: Loading graph...
    TFDetector: Loading graph...
    TFDetector: Detection graph loaded.
    TFDetector: Detection graph loaded.
    Loaded model (batch level) in 6.78 seconds
    Loaded model (batch level) in 6.78 seconds
    Processing image /home/...
    Processing image /home/...
    2020-08-17 09:43:40.366492: W tensorflow/core/framework/cpu_allocator_impl.cc:81] Allocation of 20054016 exceeds 10% of system memory.
    2020-08-17 09:43:40.366493: W tensorflow/core/framework/cpu_allocator_impl.cc:81] Allocation of 20054016 exceeds 10% of system memory.
    

    When I don't use the ncores option, I process 30 images in ~5 minutes. I've waited 10 minutes with the option, but it didn't process even 1 image.

    Do you know what could cause this behaviour ?

    Error when image file is corrupted

    Some images, especially from Bushnell cameras, are corrupted (there is missing data). An example is: Snapshot Serengeti\S1\I03\I03_R1\S1_I03_R1_PICT3967.JPG
    The run_tf_detector_batch.py script crashes on those images. The only way to process the images is to open them in an image editor and save them again, replacing the bad data with some background value.

    Adding the following code to run_tf_detector_batch.py will replace the bad data automatically and allows the batch detector to run over all images without an error;

    from PIL import ImageFile
    ImageFile.LOAD_TRUNCATED_IMAGES = True

    Confused about language for how to set-up a local batch process with the pre-trained Megadetector_v3

    Hi!

    I am confused about how to initiate the batch processing functionality of the megadetector. I've now successfully called the single image detection script, but am not completely understanding what phrasing to use when designating a multiple images to be sifted through the run_tf_detector_batch.py

    I'm new to all of this, I've learned what a json file is and see that designating one is necessary for the batch process. Would someone be able to point me in the right direction for how to phrase the command to designate multiple images to the batch process script?

    Thanks for your time!

    edit: Correction, I now understand the purpose of the json file and it's role with using the visualization tool. My main confusion is how do you communicate to the script that an entire directory of images is to be classified as opposed to just one.

    is this error means my gpu memory_size is small

    is this error means my gpu memory_size is small
    [00:00<?, ?it/s]2022-04-25 01:31:03.333175: W tensorflow/core/grappler/costs/op_level_cost_estimator.cc:690] Error in PredictCost() for the op: op: "CropAndResize" attr { key: "T" value { type: DT_FLOAT } } attr { key: "extrapolation_value" value { f: 0 } } attr { key: "method" value { s: "bilinear" } } inputs { dtype: DT_FLOAT shape { dim { size: -1204 } dim { size: -1205 } dim { size: -1206 } dim { size: 1088 } } } inputs { dtype: DT_FLOAT shape { dim { size: -25 } dim { size: 4 } } } inputs { dtype: DT_INT32 shape { dim { size: -25 } } } inputs { dtype: DT_INT32 shape { dim { size: 2 } } value { dtype: DT_INT32 tensor_shape { dim { size: 2 } } int_val: 17 } } device { type: "GPU" vendor: "NVIDIA" model: "NVIDIA GeForce GTX 1050 Ti" frequency: 1392 num_cores: 6 environment { key: "architecture" value: "6.1" } environment { key: "cuda" value: "11020" } environment { key: "cudnn" value: "8100" } num_registers: 65536 l1_cache_size: 24576 l2_cache_size: 1048576 shared_memory_size_per_multiprocessor: 98304 memory_size: 2917584078 bandwidth: 112128000 } outputs { dtype: DT_FLOAT shape { dim { size: -25 } dim { size: 17 } dim { size: 17 } dim { size: 1088 } } }

    Script to convert Mega output JSON to COCO?

    Hi, we are trying to do some transfer learning with our Megadetector output JSON (retraining Mobilenet in TF2) and just realized the JSON output format from Megadetector is not in COCO format. Do you know if there is a handy script laying around somewhere that converts the images..detections into annotations and fixes the bounding box coordinates and any other differences i haven't learned about yet! Thank you

    Error installing tensorflow 1.13.1

    I couldn't install tensorflow using the readme instructions:

    $ pip install tensorflow==1.13.1
    ERROR: Could not find a version that satisfies the requirement tensorflow==1.13.1 (from versions: 2.2.0rc1, 2.2.0rc2, 2.2.0rc3, 2.2.0rc4, 2.2.0, 2.2.1, 2.2.2, 2.3.0rc0, 2.3.0rc1, 2.3.0rc2, 2.3.0, 2.3.1, 2.3.2, 2.4.0rc0, 2.4.0rc1, 2.4.0rc2, 2.4.0rc3, 2.4.0rc4, 2.4.0)
    ERROR: No matching distribution found for tensorflow==1.13.1
    

    But after some googling this method worked for me:

    python -m pip install --upgrade https://storage.googleapis.com/tensorflow/mac/cpu/tensorflow-1.13.1-py3-none-any.whl
    

    ModuleNotFoundError: No module named 'ct_utils'

    Hello
    I'm new to using Tensorflow and Python, and have possibly made a silly mistake, but I keep getting this error when I try and run the run_tf_detector.py script:

    python CameraTraps/detection/run_tf_detector.py md_v4.1.0.pb --image_file IMG0027.JPG
    Traceback (most recent call last):
    File "CameraTraps/detection/run_tf_detector.py", line 54, in
    from ct_utils import truncate_float
    ModuleNotFoundError: No module named 'ct_utils'

    Setup is:
    Ubuntu 18.04
    python --version
    Python 3.6.8

    echo $PYTHONPATH
    ~/ml/microsoft_mega/CameraTraps;~/ml/microsoft_mega/ai4eutils
    which was created via
    export PYTHONPATH="~/ml/microsoft_mega/CameraTraps;~/ml/microsoft_mega/ai4eutils"
    so it is possible I've made a mistake here as the instructions are for Windows.

    I'm not using Anaconda, but a Python virtual environment. Tensorflow and other pip libraries all installed without problems.
    Many thanks, and apologies if I've overlooked something simple.
    Roy

    MegaDetector: broken data stream when reading image file / cannot join current thread

    Before I start digging into it further, has anyone come across the following problem? I've run the detector twice and got the same result... thinking along the lines of corrupt file or faulty disk?

    Potentially a duplicate of #94 but doesn't contain any logs so unsure.

    $ PYTHONPATH=$PYTHONPATH:$(pwd) python3 detection/run_tf_detector_batch.py --recursive --forceCpu --checkpointFrequency 1000
     --outputRelativeFilenames ./detection/megadetector_v3.pb ../borneo-dataset/release/0.5/SAFE/SAFE_2/ ../Borneo-0.5-SAFE2.txt
                                                                                                                 
    tensorflow tf version: 1.14.0                                     
    2019-10-03 12:28:29.084524: I tensorflow/core/platform/cpu_feature_guard.cc:142] Your CPU supports instructions that this TensorFlow binary was not compiled to use: FMA
    2019-10-03 12:28:29.381379: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2400065000 Hz
    2019-10-03 12:28:29.386639: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x3d43f60 executing computations on platform Host. Devices:
    2019-10-03 12:28:29.386727: I tensorflow/compiler/xla/service/service.cc:175]   StreamExecutor device (0): <undefined>, <undefined>
    tf_detector.py, tf.test.is_gpu_available: False                                               
    WARNING: Logging before flag parsing goes to stderr.                            
    W1003 12:28:29.404660 140666144171840 deprecation_wrapper.py:119] From detection/run_tf_detector_batch.py:51: The name tf.logging.set_verbosity is deprecated. Please us
    e tf.compat.v1.logging.set_verbosity instead.                                                                 
                    
    W1003 12:28:29.405052 140666144171840 deprecation_wrapper.py:119] From detection/run_tf_detector_batch.py:51: The name tf.logging.ERROR is deprecated. Please use tf.com
    pat.v1.logging.ERROR instead. 
                                                                                                                          
    Running detector on 57170 images                       
    Loading model...                                                                                             
    tf_detector.py: Loading graph...                                                
    tf_detector.py: Detection graph loaded.                     
    Loaded model in 15.1 seconds                                                                                  
    Running detector...                     
    0it [00:00, ?it/s]2019-10-03 12:28:55.971887: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because env
    var TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set.  If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU.  To confirm that XL
    A is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile.
    var TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set.  If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU.  To confirm [0/498]
    A is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile.
    2019-10-03 12:29:01.395446: W tensorflow/core/framework/allocator.cc:107] Allocation of 377318400 exceeds 10% of system memory.
    2019-10-03 12:29:01.487368: W tensorflow/core/framework/allocator.cc:107] Allocation of 377318400 exceeds 10% of system memory.
    2019-10-03 12:29:01.982219: W tensorflow/core/framework/allocator.cc:107] Allocation of 99878400 exceeds 10% of system memory.
    2019-10-03 12:29:02.555869: W tensorflow/core/framework/allocator.cc:107] Allocation of 159744000 exceeds 10% of system memory.
    2019-10-03 12:29:02.715443: W tensorflow/core/framework/allocator.cc:107] Allocation of 159744000 exceeds 10% of system memory.
    Checkpointing 1 images to /tmp/detector_batch/tmpud3njeui......done
    1000it [2:11:24,  7.87s/it]Checkpointing 1001 images to /tmp/detector_batch/tmp6wgeh416......done                                                                       
    2000it [4:23:28,  7.90s/it]Checkpointing 2001 images to /tmp/detector_batch/tmp_6tnl91e......done                                                                       
    3000it [6:36:44,  8.07s/it]Checkpointing 3001 images to /tmp/detector_batch/tmpnwm51sek......done                                                       
    3344it [7:23:24,  8.08s/it]Traceback (most recent call last):                                                                  
      File "detection/run_tf_detector_batch.py", line 559, in <module>                                                             
        main()                                                                                                                    
      File "detection/run_tf_detector_batch.py", line 554, in main                                                                 
        load_and_run_detector(options)                                                                                             
      File "detection/run_tf_detector_batch.py", line 437, in load_and_run_detector
        boxes,scores,classes,imageFileNames = generate_detections(detector,imageFileNames,options)   
      File "detection/run_tf_detector_batch.py", line 167, in generate_detections                    
        imageNP = PIL.Image.open(image).convert("RGB"); imageNP = np.array(imageNP)                  
      File "/home/bencevans/.local/lib/python3.6/site-packages/PIL/Image.py", line 912, in convert
        self.load()                                                   
      File "/home/bencevans/.local/lib/python3.6/site-packages/PIL/ImageFile.py", line 261, in load
        raise_ioerror(err_code)                                   
      File "/home/bencevans/.local/lib/python3.6/site-packages/PIL/ImageFile.py", line 58, in raise_ioerror
        raise IOError(message + " when reading image file")                        
    OSError: broken data stream when reading image file                                           
    Exception ignored in: <bound method tqdm.__del__ of 3344it [7:23:25,  8.08s/it]>
    Traceback (most recent call last):                                             
      File "/home/bencevans/.local/lib/python3.6/site-packages/tqdm/_tqdm.py", line 931, in __del__
        self.close()
      File "/home/bencevans/.local/lib/python3.6/site-packages/tqdm/_tqdm.py", line 1133, in close 
        self._decr_instances(self)
      File "/home/bencevans/.local/lib/python3.6/site-packages/tqdm/_tqdm.py", line 496, in _decr_instances
        cls.monitor.exit()                                 
      File "/home/bencevans/.local/lib/python3.6/site-packages/tqdm/_monitor.py", line 52, in exit
        self.join()                                                                 
      File "/usr/lib/python3.6/threading.py", line 1053, in join
        raise RuntimeError("cannot join current thread")                                                          
    RuntimeError: cannot join current thread
    

    Failed to load the native TensorFlow runtime.

    I'm a little familiar with Python and Anaconda, and have followed the steps for installation on a windows machine. However, I get the error "Failed to load the native TensorFlow runtime" when I try to run_tf_detector.py - can anyone point me to where I am going wrong?

    Crop MegaDetector json

    Hi,
    I've just started using megadetector and I was wondering if you could recommend any good code to crop my images using the boxes identified?
    Thanks!

    Using the estimated models outside of MegaDetector

    Hi,

    Could it be possible to use the estimated models (I mean, the model weights) outside of MegaDetector, for those of us who have their own codes already running?
    I would like to reuse your estimated models in my codes implemented in Keras. Do you think it could be possible?

    I think this could be very interesting for many people...

    Training megadetector on my own images

    Hello,
    Thank you so much for this great program!

    I have thousands of images from trap cameras, and while megadetector works wonderfully on some landscapes, it fails in other (it detects trees and stones as animals).

    What is a useful way to train megadetector on my own set of images?

    Cheers,
    Bar

    tensorflow-gpu version

    Hello, just a quick question about the supported tensorflow-gpu version. I am having installation issues with version 1.13.1 and was wondering if there is a more recent version that works, or if it must necessarily be 1.13.1. Thanks!

    Model doesn't learn to classify

    Hey guys!

    So, I've been trying to work with this amazing repo you guys put out there but, even though I can run the scripts and execute training, the model simply doesn't seem to be learning how to classify. My goal is to train a classifier on my own species.

    I will try to describe the path I made to get where I am now.
    I'm on Ubuntu 18.04, tensorflow-gpu=1.15 and have installed all necessary dependencies to run your repository.
    I mainly followed this to try and achieve what I needed: https://github.com/microsoft/CameraTraps/blob/master/classification/TUTORIAL.md

    First, I started with a VOC format dataset (consists of images + xml with the detections). Because here we need to use the COCO json format, I used the following script to make such changes:
    https://github.com/Tony607/voc2coco/blob/master/voc2coco.py
    And also, here's an example of an original XML file.
    image
    Now I bring up the question: the "path" field is not correct, as this file is not in the original place anymore. However, it doesn't seem to affect anything, because I can still generate the COCO json. Here's an example extracted from said json:
    image
    Which seems ok, since it got the file name right (this is the name it'll search when I point to the images folder, right?) and as I understand COCO json doesn't use xmax and ymax but rather width and height, right as well? The category id also matches the one expected.
    Alright, the next step was generating the tf records.
    I started trying by using your script, but it seemed to be trying to import a file that did not exist, so instead I went in and used this one (from TF OD API):
    https://github.com/tensorflow/models/blob/master/research/object_detection/dataset_tools/create_coco_tf_record.py
    As for the test and validation folders, I pointed it to the same and simply ignored the validation files (since here only train and test matters). Everything went well and I got my folder full of tf records (I just had to rename them because the training script here searches for "train-" and "test-" so I add these to the beginning of my file name.
    So, with everything in hand, I went to the training part.
    I had already the tf records and model checkpoint (inception v4), just needed to prepare some files for training. First copy and modify the dataset python script. I changed the "SPLIT_TO_SIZE" variable to: SPLITS_TO_SIZES = {'train': 7624, 'test': 1885} and "_NUM_CLASSES = 17". Furthermore, I added my dataset to "dataset_factory.py". Finally, I modify the .sh script to adapt and create the one for my own dataset.
    DATASET_DIR points to the folder containing the tf records. TRAIN_DIR to the output path (log/date...). CHECKPOINT_PATH to where I had my inception model checkpoint. And MODEL_NAME was still "inception_v4". For better visualization:
    image

    Finally, I executed the script by typing: bash ../training_scripts/train_deep_inception_v4.sh
    It trained reeeeally slowly, with the loss decreasing .0001 per 300~400 steps. It seemed ok though cause it was actually going down. I left the model training overnight, unfortunately it trained only for a few hours.
    However, when I woke up and came to check the model, it had stopped immediately after an evaluation step which outputed accuracy and recall equal to 1, which I knew wasn't right. I then exported the graph to see an actual inference and this is what I got:
    image
    So, the detection is working well, since I use the megadetector model you guys provide, however, classification absolutely sucks. What it is doing is simply labeling all animals as the first label it sees on my "label.txt" file. If I change which class comes first, classification changes as well. So, it doesn't like the model is learning to classify anything, and I simply can't find a reason why.
    I'm working on this problem for quite a while now, have tried to use TF OD API directly but with no success as well.

    Any tips? Directions? Questions?
    Please any help is welcome!

    Colab notebook: annotated images not available in [Visualization_Folder]

    Thanks for sharing the colab notebook. As a newbie in this, it's very helpful to be able to test your script before investing time into deploying it at full scale on all the camera traps photos in our project (nest success and invasive predator in an endangered Caribbean seabird).

    I would like to share with collaborators the annotated images resulting from the script but the Google Drive folder used as [Visualization_Folder] does not contain any file. Also, Colab's runtime keeps disconnecting and cannot make through the annotation/display of 25 samples photos, hence the need to be able to save those images that have gone through the process.
    Thus, my question may be very basic: how to save (or access) the images that are annotated and displayed in the last step (cell below)?

    Here are the steps of interest:

    images_dir = '/content/drive/My Drive/ctrap'
    
    # choose a location for the output JSON file
    output_file_path = '/content/drive/My Drive/ctrap/detector_2020-09-28.json'
    

    Here we use the visualize_detector_output.py in the visualization folder of the Camera Traps repo to see the output of the MegaDetector visualized on our images. It will save images annotated with the results (original images will not be modified) to the [Visualization_Folder] you specify here.

    visualization_dir = '/content/My Drive/ctrap/visualize_2020-09-28'  # pick a location for annotated images
    
    !python visualize_detector_output.py "$output_file_path" "$visualization_dir" --confidence 0.8 --images_dir "$images_dir"
    
    import os
    from PIL import Image
    
    for viz_file_name in os.listdir(visualization_dir):
      print(viz_file_name)
      im = Image.open(os.path.join(visualization_dir, viz_file_name))
      display(im)  # display() is an iPython method that comes with the notebook
    

    I understand that the images are supposed to be saved in the [Visualization_Folder] but this folder is empty (see edited screeenshots below - note that the visualize_2020-09-28 folder is open).
    Untitled

    CameraTraps/api/batch_processing/api_core/ readme not specific enough

    I am attempting to set up the API in my own subscription and I have never used Azure Machine Learning workspace before. In the CameraTraps/api/batch_processing/api_core/README.md it states: to set up a new "AML workspace" but there are two resource types on azure that have similar names to this and it is not clear which I should use. The two names are:

    • Azure Machine Learning service workspace
    • Azure Machine Learning Studio workspace

    After reading the descriptions, I assume it is the Azure Machine Learning service workspace but I would like clarification.

    ModuleNotFoundError: No module named 'visualization'

    Hi there,
    I've probably done something stupid, but can't figure out what. I've run this on two different computers (but within the same university, so maybe there are some rights issues) and get to the same error.

    Running Windows 10, Anaconda and Git installed fine. I ran the following lines of code without error:

    mkdir c:\git
    cd c:\git
    git clone https://github.com/Microsoft/cameratraps
    git clone https://github.com/Microsoft/ai4eutils
    cd c:\git\cameratraps
    conda env create --file environment-detector.yml
    conda activate cameratraps-detector
    set PYTHONPATH=%PYTHONPATH%;c:\git\cameratraps;c:\git\ai4eutils;c:\git\yolov5

    The git folder is in C:\git and the megadetector file is in C:\megadetector. (i.e. using the suggested defaults).

    But when I run the following code, I get the error below.

    (cameratraps-detector) PS C:\git\CameraTraps> python detection\run_detector.py "c:\megadetector\md_v5a.0.0.pt" --image_file ".\test_images\test_images\caltech_camera_traps_58a022a4-23d2-11e8-a6a3-ec086b02610b.jpg" --threshold 0.2

    Traceback (most recent call last):
    File "detection\run_detector.py", line 53, in
    import visualization.visualization_utils as viz_utils
    ModuleNotFoundError: No module named 'visualization'

    I did try do some trouble shooting, and two websites suggested that I run the following code:
    pip install tf-keras-vis
    pip install keras-vis

    I did this. But no change.

    What I am I missing? Thank you in advance for your time, and please let me know if there is any additional information that you may need.

    eliminating repeated detection

    Hi Dan,
    thank you for your quick reply! (Issue #290 )

    after running the pipeline:

    • find_repeat_detections.py
    • manually removing false posittives
    • remove_repeat_detections.py
      I end up with a json file.
      when running this json file in the script postprocess_batch_results.py, I get the following error:
      File "load_api_results.py", line 93, in load_api_results
      assert s in detection_results, 'Missing field {} in detection results'.format(s)
      AssertionError: Missing field info in detection results

    Looking at the json file, it doesn't have the same fields as in the output json file of the run_detector_batch.py.

    Is there a way use the postprocess_batch_results.py on the json file of the repeat detections?

    Thanks

    `group` class?

    Hi

    Please could you clarify as to what the group class is supposed to represent? Group of animals, group of people, vehicles...?

    Thanks

    Ben

    Prediction results from locally loaded SavedModel significantly differ from TF Serving model

    Hi,

    I'm working with MegaDetector v3, 2019.05.30 and noticed that the prediction results for a given image significantly differ between the locally loaded SavedModel and the TFServing model -> where:

    locally loaded SavedModel = saved_model_normalized_megadetector_v3_tf19.tar.gz
    TFServing model = saved_model_megadetector_v3_tf19.zip

    The locally loaded quite underperforms vs. the TFServing model. Is this something you have seen before? Happy to share my code (am on TF2.1)

    Thanks & best regards,

    Mike

    How to use this with TF Serving?

    Hi, I'm a seasoned JS/web/full stack developer but fairly new to ML and almost no Python experience.

    I'm trying to setup an endpoint (similar to your API endpoint), but running on Google Cloud Infrastructure, where I can submit an image (or several for batching) and get json results back.

    From the documentation I understood that I can download this model (https://lilablobssc.blob.core.windows.net/models/camera_traps/megadetector/md_v4.1.0/md_v4.1.0_saved_model.zip), and exposing it up using TFServing.
    However I'm hitting some dead ends on how to input the image as an argument.

    I set the model up using tf version 1.15 on Google's Vertex AI Prediction service (https://cloud.google.com/vertex-ai/docs/general/import-model), and tried this for an input argument:

    {"instances": [{"b64": ""}]}

    But I get this error back:
    { "error": "Failed to process element: 0 of \'instances\' list. Error: Invalid argument: JSON Value: {\n \"b64\": \"\"\n} Type: Object is not of expected type: uint8" }

    I'm thinking that maybe the Vertex AI input parameters need to be formatted somehow to match what the model expects, but I can't find the documentation on what arguments the model expects when used with TF Serving?

    Any pointers is much appreciated! Am I even on the right path to try and setup the service using Vertex AI's services? Is there maybe a tutorial on how to use the model using TF serving?

    Recommend Projects

    • React photo React

      A declarative, efficient, and flexible JavaScript library for building user interfaces.

    • Vue.js photo Vue.js

      πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

    • Typescript photo Typescript

      TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

    • TensorFlow photo TensorFlow

      An Open Source Machine Learning Framework for Everyone

    • Django photo Django

      The Web framework for perfectionists with deadlines.

    • D3 photo D3

      Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

    Recommend Topics

    • javascript

      JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

    • web

      Some thing interesting about web. New door for the world.

    • server

      A server is a program made to process requests and deliver data to clients.

    • Machine learning

      Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

    • Game

      Some thing interesting about game, make everyone happy.

    Recommend Org

    • Facebook photo Facebook

      We are working to build community through open source technology. NB: members must have two-factor auth.

    • Microsoft photo Microsoft

      Open source projects and samples from Microsoft.

    • Google photo Google

      Google ❀️ Open Source for everyone.

    • D3 photo D3

      Data-Driven Documents codes.