I am trying to compute probability from viterbi score but could not get the values to

Not sure how you try, but I tried the example here (<a href="https://pytorch-crf.readt

Ok. I'll try to write a test case and validate. As per size of <code class="notran

computation of partition function about pytorch_neural_crf HOT 11 CLOSED

allanj commented on July 29, 2024

computation of partition function

from pytorch_neural_crf.

Comments (11)

y1450 commented on July 29, 2024 1

yes. It works. the log likelihood and norm computation are correct and same for both implementations.
My mistake was that I thought the num_tags in pytorch crf included pad,start and stop tags.
Thank you very much. You have been very helpful with issues.

from pytorch_neural_crf.

allanj commented on July 29, 2024

You mean test the correctness with pytorch-crf?

from pytorch_neural_crf.

y1450 commented on July 29, 2024

yes, the correctness of implementation.
I tested it with small sequence length of N=3, basically hard coded the values of transition and scores in the respective implementations and compare the forward labeled scores.
I can share a dummy sample notebook if you would like to look?

from pytorch_neural_crf.

allanj commented on July 29, 2024

Sure. no problem. That will be appreciated

from pytorch_neural_crf.

allanj commented on July 29, 2024

Not sure how you try, but I tried the example here (https://pytorch-crf.readthedocs.io/en/stable/).
everything looks fine.

import torch
from torchcrf import CRF
num_tags = 5  # number of tags is 5
model = CRF(num_tags)
seq_length = 3
batch_size = 2
emissions = torch.randn(seq_length, batch_size, num_tags)
tags = torch.tensor([[0, 1], [2, 4], [3, 1]], dtype=torch.long)
print(model(emissions, tags))


from src.model.module import LinearCRF
from src.data.data_utils import START_TAG, STOP_TAG, PAD
labels = ['A', 'B', 'C', 'D', 'E', START_TAG, STOP_TAG, PAD]
label2idx = {label: idx for idx, label in enumerate(labels)}
mycrf = LinearCRF(label_size=8, label2idx= label2idx, add_iobes_constraint=False, idx2labels=labels)
converted_emissions = torch.cat([emissions.transpose(0, 1), torch.full((batch_size, 3, 3), -10000)], dim=-1)
converted_tags= tags.transpose(0,1)
mask=torch.tensor([[1,1,1],[1,1,1]]).bool()
mycrf.transition.data[:5, :5] = model.transitions.data ## transition between labels
mycrf.transition.data[5, :5] = model.start_transitions.data ## transition from start tag
mycrf.transition.data[:5, 6] = model.end_transitions.data ## transition to end tag
unlabeled, labeled = mycrf(converted_emissions, torch.LongTensor([3,3]), converted_tags, mask)
loglikelihood = labeled- unlabeled
print(loglikelihood)

from pytorch_neural_crf.

y1450 commented on July 29, 2024

thanks for the example, I'll try it with the example and update you on it.

from pytorch_neural_crf.

y1450 commented on July 29, 2024

I am having a slight confusion about computing probability of most likely sequence and would appreciate your help.
viterbi score to probability
As per my understanding it is
np.exp(viterbi_score - z) , both viterbi_score and z are in log space.

if I change my my dataloader batch_size then the value of z changes as it is dependent on the batch (input). The problem is then I cant use batch_size > 1.
which is not efficient.

On intuitive side, the probability of sequence should be same given the same model parameters, irrespective of the batch_size.
Is my reasoning correct?

from pytorch_neural_crf.

allanj commented on July 29, 2024

Not really.

Your viterbi score is always with size batch_size, same as z.
So even if you do exponential you still have the final probability with size "batch_size".

So the size of your batch, will never affect the value in each instance

from pytorch_neural_crf.

y1450 commented on July 29, 2024

Ok. I'll try to write a test case and validate.
As per size of z, on top of mind I think it is a scalar not a vector of batch_size which requires setting batch_size=1.

from pytorch_neural_crf.

y1450 commented on July 29, 2024

my Mistake , I assumed the z returned were of batchsize but the forward unlabel already sums them and then return it.

pytorch_neural_crf/src/model/module/linear_crf_inferencer.py

Line 105 in 18d0638

return torch.sum(last_alpha)

here last_alphas are the z values.
Maybe it would be a good idea, send the individual z rather than sum(z) , which would make forward_unlabeled api more useful,eg computing probability.
Again, thanks for help.

from pytorch_neural_crf.

allanj commented on July 29, 2024

yeah, seems it might make the code more readable. I will see how I can do that

from pytorch_neural_crf.

computation of partition function about pytorch_neural_crf HOT 11 CLOSED

Comments (11)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent