DSMIL: Dual-stream multiple instance learning networks for tumor detection in Whole Slide Image
-
Updated
Apr 29, 2024 - Python
DSMIL: Dual-stream multiple instance learning networks for tumor detection in Whole Slide Image
Repository for Master thesis project investigating classification of 3D chest CT scans using Vision Transformer.
Experimental removal / shuffling of layers in CLIP ViT + Text Transformer
Deep Fake Detection using Vision Transformer and Neural Network
Final project for the master's degree in Computer Science course "Advanced Machine Learning" (AML) at the University of Rome "La Sapienza" (A.Y. 2023-2024).
Like Golden Gate Claude, but with a CLIP Vision Transformer ~ feature activation manipulation fun!
This repo showcase the ENPM673: Perception for Autonomous robots final project. A vision transformer (ViT) architecture SegFormer, has been replicated for implementing semantic segmentation. Furthermore, it was deployed on raspberry pi with pi cam setup for validating the real-time performance.
Vision transformer and CNN implementations for image classification using PyTorch.
Code for the paper "Relating Implicit Bias and Adversarial Attacks through Intrinsic Dimension" [https://arxiv.org/abs/2305.15203] -- Now +CLIP!
Add a description, image, and links to the visiontransformer topic page so that developers can more easily learn about it.
To associate your repository with the visiontransformer topic, visit your repo's landing page and select "manage topics."