Comments (3)
Hello,
The resize function is forbidden, since this method relies on the Block Artifacts Grids
(BAGs) for tampering localization. When the input image is resized (bigger or smaller), its BAGs will be destroyed, so this method does not work in this case.
However, by utilizing the BAGs, model can gain much more better fine-grained perception ability to detect visually consistent tampering, which is important for tampered text detection in documents. Moreover, by utilizing the BAGs, model can gain much more better cross-domain generalization.
So, this methods is sacrificing anti-resize for better detection and generalization abilities.
It's also notable that if an image is resized or never has BAGs (e.g. smartphone screen captures), it can be easily distinguished from the images that have BAGs in Fourier frequency domain by a simple binary classification model.
Therefore, in real-world application, we can conduct tampering localization in a Mixture of Experts manner: first, use a classifier to identify whether the input image has BAGs, if it has BAGs, then crop it to patches and feed the patches into the frequency-based model like this model; if it does not have BAGs, then feed it into a common pure RGB model that is anti-resize. By doing this, the advantages of both of the two model types can be maximized.
In some toB cases, we also can tell users not to resize the image before upload and let them re-upload one if the original one is detected to be resized.
For question 2, the performance is totally not related to "clarity", it is totally related to the existence of BAGs.
from doctamper.
@RobotDouble Could you please help me to understand how to test it on our own dataset?
from doctamper.
@Nomiluks Please refer to L39~L137 https://github.com/qcf-568/DocTamper/blob/main/models/tsroie/infer_sroie.py
and this colab notebook https://colab.research.google.com/drive/1rWaSKy2Rsy5welyvj6FbzF01o2zv8ips?usp=sharing
from doctamper.
Related Issues (20)
- 请问仓库里的代码是关于什么的呢,不是训练代码那是啥? HOT 2
- 能否提供数据集篡改类型的信息? HOT 3
- public pristine set HOT 1
- how to implement CLTD? HOT 3
- Specific tampering type of DocTamper Dataset HOT 2
- 请问数据集中是否存在组合篡改类型,以及能否提供对比的其他方法在DocTamper上训练的模型权重? HOT 3
- details of data augmentation HOT 2
- DocYamperV1 HOT 1
- Dataset password HOT 13
- Tampering detection in an image captured from a tampered document. HOT 1
- 有关训练和测试压缩因子的问题 HOT 1
- model.load_state_dict(ckpt['state_dict']) 错误
- 某些特定图片的量化表似乎导致了程序错误? HOT 2
- loss HOT 4
- Requesting Guidance on Running Model on Local Machine HOT 1
- 请问推理代码是哪个呢? HOT 2
- Question about infer on JPEG images and Quant table HOT 3
- 关于在DocTamper上训练CAT-Net HOT 2
- 关于评价指标 HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from doctamper.