Coder Social home page Coder Social logo

story2personality's Issues

MBTI labels for characters

Hi,
Sorry to bother again.
According to the paper, your work has provided a dataset including character MBTI labels, but I can't find them in the currently published version. Is this part not yet released?

parse.py running problem

Hello, I have interest your dataset. So I try your code but I meet some error as below.

(base) byeongjuncho@2080C:/data5/byeongjuncho2/Personality/Story2Personality$ python parse.py 
[*] Columns in documents.csv: ['document_id', 'set', 'kind', 'story_url', 'story_file_size', 'wiki_url', 'wiki_title', 'story_word_count', 'story_start', 'story_end']
Analyze segmentation signals: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████| 433/433 [00:43<00:00,  9.96it/s]
[*] Segmentation Analysis Results
  - Success: 300
  - Fail: 121
  - # of movies: 433
Segmenting:   0%|                                                                                                                                         | 0/433 [00:00<?, ?it/s][*] 00ee9e01a0e581e0d8cbf7e865a895147c480c5e:  # of scenes = 0,  # of dialogs = 0
[*] 00f9dbb0a851bc6099d5216e5fa8719b2ac3b82b:  # of scenes = 0,  # of dialogs = 0
[*] 020773a0ca71155173ec4affe6a2496a6cb45216:  # of scenes = 0,  # of dialogs = 0
[*] 032bcfd170a98fb5ed752c1d25b678b8589de7d7:  # of scenes = 0,  # of dialogs = 0
[*] 0434145d9284423a36a714fc55246ed0bdc39a82:  # of scenes = 0,  # of dialogs = 0
Segmenting:   2%|██▉                                                                                                                             | 10/433 [00:00<00:04, 92.47it/s][*] 04dc9134485dee27252734f24c2a3b3dd397b8a4:  # of scenes = 0,  # of dialogs = 0
[*] 050f88f6c8fed44ecbd7658b0f450fb705ce368d:  # of scenes = 0,  # of dialogs = 0
[*] 08373775c3d82d8bbf4ff286a3a0076a3627c652:  # of scenes = 0,  # of dialogs = 0
[*] 087c29795e06ed5b8e29361606c12b6390db9270:  # of scenes = 0,  # of dialogs = 0
[*] 0a93e857113efca05c6274e7af3ba4f03a023b9f:  # of scenes = 0,  # of dialogs = 0
[*] 0b9f563bad33316d94f4f339fb151a4b33f25c3f:  # of scenes = 0,  # of dialogs = 0
Segmenting:   5%|██████▏                                                                                                                        | 21/433 [00:00<00:04, 100.67it/s][*] 0bd270c8c46f84a4abd99d65e2f17a9e11a7f76d:  # of scenes = 0,  # of dialogs = 0
[*] 0c112bc4c07ded9222ae308d1f20591b76aed4de:  # of scenes = 0,  # of dialogs = 0
[*] 0cb3433c5ac030dba47414e2655c3c49e4f37527:  # of scenes = 0,  # of dialogs = 0
[*] 0eae0c4823bd37d58a7b23798c37fd0818bad032:  # of scenes = 0,  # of dialogs = 0
[*] 0edab258db9c1e0483eeaa442e0522bdefa9d445:  # of scenes = 0,  # of dialogs = 0
[*] 0fe1c42d024d1d3f001a9aadb400512ecbda2b9f:  # of scenes = 0,  # of dialogs = 0
[*] 0ffdd0e4abb67f0012a68d719a2d509ee8ec643d:  # of scenes = 0,  # of dialogs = 0
[*] 117ba539f2d311e3067064dca3b5fd68a7b95ee3:  # of scenes = 0,  # of dialogs = 0
Segmenting:   7%|█████████▍                                                                                                                      | 32/433 [00:00<00:04, 94.57it/s][*] 121c28edd4b0474f2386d9290dcee15b5352e006:  # of scenes = 0,  # of dialogs = 0
[*] 139004cd2411c6909a752ecf897d326ed8bf419c:  # of scenes = 0,  # of dialogs = 0
[*] 141a5885bb6d8df56dadbd617b5dec7db102701d:  # of scenes = 0,  # of dialogs = 0
[*] 145200abf14baeffa646797dfbfa58861cb4b079:  # of scenes = 0,  # of dialogs = 0
[*] 14add440692bd1e8fbb8c2df367ba4cb27426383:  # of scenes = 0,  # of dialogs = 0
[*] 150683ab8156bdc78ebd23a2ed7f7e265b780bd0:  # of scenes = 0,  # of dialogs = 0
[*] 1508452704b5829941b98aeee2eeba192f5beae5:  # of scenes = 0,  # of dialogs = 0
[*] 1625d7038823e0ca81c39675c3ea7fe31f7fbe04:  # of scenes = 0,  # of dialogs = 0
[*] 16a6a06abf4ff93fe3143b66c45dbeae6550816b:  # of scenes = 0,  # of dialogs = 0
Segmenting:  10%|████████████▍                                                                                                                   | 42/433 [00:00<00:04, 86.93it/s][*] 173b52e21786674aef19e640355e4a6fc869b34e:  # of scenes = 0,  # of dialogs = 0
[*] 18bcdf16fb896faf7fb36db6b752f16f0343ad24:  # of scenes = 0,  # of dialogs = 0
[*] 197cf28c98cf416fed651b7c569073249b37f855:  # of scenes = 0,  # of dialogs = 0
[*] 1995b44b685baf07ae17d56bb810ab8295785010:  # of scenes = 0,  # of dialogs = 0
[*] 199ec80c97f752cd8e00d2ca6f8e5f48603fefb8:  # of scenes = 0,  # of dialogs = 0
[*] 19dd268e42d372d73072c40bc3b8d1c46e07a3a2:  # of scenes = 0,  # of dialogs = 0
[*] 1a020bc35fe8ff93de0a570f7cf70e958880241c:  # of scenes = 0,  # of dialogs = 0
[*] 1a447099edd43fe60f1bf8f2b1eabbd11a5c3989:  # of scenes = 0,  # of dialogs = 0
[*] 1a7119c0cbb4a82b93913600187c98fc9dfa52b0:  # of scenes = 0,  # of dialogs = 0
Segmenting:  12%|███████████████▎                                                                                                                | 52/433 [00:00<00:04, 88.01it/s][*] 1ae015eaa8cd3b66136578e947afc72ed338fe10:  # of scenes = 0,  # of dialogs = 0
[*] 1b48982220584874f6abed587219f6d73e85abd8:  # of scenes = 0,  # of dialogs = 0
[*] 1b8ca5c74e79ccf56eee42155d467c89c8445019:  # of scenes = 0,  # of dialogs = 0
[*] 1da161b6007d46d85eda16ae391e1e78218fabd2:  # of scenes = 0,  # of dialogs = 0
[*] 1ddea1c3f13183fbe029834b5a3ab505600c5dfa:  # of scenes = 0,  # of dialogs = 0
[*] 1f8e03c7b6a6864108933fba1906455f78e4cfa6:  # of scenes = 0,  # of dialogs = 0
[*] 21140bb7b7f2e0c9ccadf799b60b577685c42647:  # of scenes = 0,  # of dialogs = 0
[*] 2114ee4681976de8a9d75d0e411ccc47bfa1caa3:  # of scenes = 0,  # of dialogs = 0
Segmenting:  14%|██████████████████                                                                                                              | 61/433 [00:00<00:04, 87.76it/s][*] 214cb5277750c1ccddec0b10fa545c3c78c45f64:  # of scenes = 0,  # of dialogs = 0
[*] 21813f2122fca16d05e89d44f4521f7da8a8f3b7:  # of scenes = 0,  # of dialogs = 0
[*] 2453d062843edc379bdae3be69859e18bf1abd9d:  # of scenes = 0,  # of dialogs = 0
[*] 266f5b2295980ad31b5090a6c51b69055c87b3a7:  # of scenes = 0,  # of dialogs = 0
[*] 26b13fb4ac397ec6d550e209c9979116de71f467:  # of scenes = 0,  # of dialogs = 0
[*] 26da72cfd563728fc90c2a808a9c6f076fa4e815:  # of scenes = 0,  # of dialogs = 0
Segmenting:  16%|████████████████████▋                                                                                                           | 70/433 [00:00<00:04, 82.10it/s][*] 27ad3a0a84489109ca5d3cc4bf3b8b74a7b8ed9a:  # of scenes = 0,  # of dialogs = 0
[*] 2ba38d31381278d2ae8e90a2a3108debaef877bb:  # of scenes = 0,  # of dialogs = 0
[*] 2c7a9ffe4c06039bd8fee47b6235443267863686:  # of scenes = 0,  # of dialogs = 0
[*] 2e15003e58afe663f66622edbfc3d7d1c7c69a60:  # of scenes = 0,  # of dialogs = 0
[*] 2e24e035fddb9b96430c9bb2cf45d44facac6fc8:  # of scenes = 0,  # of dialogs = 0
[*] 2fc2cfddbe1ca72a7c2ae750a6c8b656dad60f9b:  # of scenes = 0,  # of dialogs = 0
Segmenting:  19%|███████████████████████▉                                                                                                        | 81/433 [00:00<00:03, 88.45it/s][*] 2fe5752889751cbb4bbeab3deba4f6e315f90f0a:  # of scenes = 0,  # of dialogs = 0
[*] 307f5583cf63457100b2d3f616d669e03a501196:  # of scenes = 0,  # of dialogs = 0
[*] 317fe46b2903da739b059733cd618db6ca49a494:  # of scenes = 0,  # of dialogs = 0
[*] 324138e357569216a5fae897cea8cba4548626c4:  # of scenes = 0,  # of dialogs = 0
[*] 33852b452ef4d43a9aa4afbeb543b6a3107e7209:  # of scenes = 0,  # of dialogs = 0
[*] 341bfd64f674b6410af0efd4fe9ab0c5c728d82d:  # of scenes = 0,  # of dialogs = 0
[*] 35891de62bab83d5b312ddeb835c7e0b245e3282:  # of scenes = 0,  # of dialogs = 0
[*] 35ae42f3419e73b7c7357c222e494eaa4161026b:  # of scenes = 0,  # of dialogs = 0
[*] 361364f57460139410c4130a1e7a58caf152c2bd:  # of scenes = 0,  # of dialogs = 0
Segmenting:  21%|███████████████████████████▍                                                                                                    | 93/433 [00:01<00:03, 95.72it/s][*] 3634471ed994ee7d4f382d8e7edbc56de5c28c42:  # of scenes = 0,  # of dialogs = 0
[*] 375d6609a90580a1ce888d45ac697464e6870010:  # of scenes = 0,  # of dialogs = 0
[*] 37b90142ab00baef4ece1ee4896b8933cd8cdb61:  # of scenes = 0,  # of dialogs = 0
[*] 386e3ca25d1aabc56f5a7eaf9714badb8ec86382:  # of scenes = 0,  # of dialogs = 0
[*] 38e24416d39a0a285ef1693adad25c9ed0c94487:  # of scenes = 0,  # of dialogs = 0
[*] 399f3571af53d150a999da5553de856ddb0815b9:  # of scenes = 0,  # of dialogs = 0
[*] 39c9fc154b2cc4030d2732fab674fc246eb28aed:  # of scenes = 0,  # of dialogs = 0
Segmenting:  24%|██████████████████████████████▏                                                                                                | 103/433 [00:01<00:03, 88.35it/s][*] 3a82541bb17890577626925533f6549efe5cf08e:  # of scenes = 0,  # of dialogs = 0
[*] 3b2427be8d3bbfe55590d1ce307f778ede5bbcd9:  # of scenes = 0,  # of dialogs = 0
[*] 3c78ea07497de438f4fcddc0d49051a7a6fa5490:  # of scenes = 0,  # of dialogs = 0
[*] 3d248aa8bba34b3f5199c1aed1443b9fa3395d03:  # of scenes = 0,  # of dialogs = 0
[*] 3e87eee49ffe1cbcc871f2310489775ffcaccbb2:  # of scenes = 0,  # of dialogs = 0
[*] 3feb46d105b7ef3bdb248064761dc309c1831466:  # of scenes = 0,  # of dialogs = 0
[*] 403479540337f035a011f0ea84b4770e31e18207:  # of scenes = 0,  # of dialogs = 0
Segmenting:  26%|████████████████████████████████▊                                                                                              | 112/433 [00:01<00:03, 85.00it/s][*] 407624e45c153cdd848f539d37ca8b5d448f9580:  # of scenes = 0,  # of dialogs = 0
[*] 40ae9851099e8fc4d00c6d3fed1549ae50a75bd0:  # of scenes = 0,  # of dialogs = 0
[*] 411d53d3c5c42990aebb7cb0bf4964f8d0a6f0bc:  # of scenes = 0,  # of dialogs = 0
[*] 42d253275a8807aa6ecf57c6c306cb24d76710f1:  # of scenes = 0,  # of dialogs = 0
[*] 43059307f3d694292ba9178da8fc3e1fe470531f:  # of scenes = 0,  # of dialogs = 0
[*] 43c4885ee70fb0f0871001cbe555520bdf2c3375:  # of scenes = 0,  # of dialogs = 0
[*] 43fce60036da73632775421436a2c2a1f970c755:  # of scenes = 0,  # of dialogs = 0
[*] 447d97a7439de3811d9b6f4dfd5685e09f5fb727:  # of scenes = 0,  # of dialogs = 0
[*] 44cd7b845627af65bf84e6ce3bfc8cfab5bf7b7c:  # of scenes = 0,  # of dialogs = 0
[*] 45c9fe33a1c1348c3ba212834bd2807100c6721a:  # of scenes = 0,  # of dialogs = 0
Segmenting:  28%|███████████████████████████████████▊                                                                                           | 122/433 [00:01<00:03, 88.10it/s][*] 46381add305c73e6d4625548324615b11dfb25c8:  # of scenes = 0,  # of dialogs = 0
[*] 46a5ee8bbf57f56ad5472e0712e98370b734145d:  # of scenes = 0,  # of dialogs = 0
[*] 46c83ab910bbf794c07735c7d55af442a54d090b:  # of scenes = 0,  # of dialogs = 0
[*] 48266045f2dbe4de0cea552d3ec8ffb541c5e182:  # of scenes = 0,  # of dialogs = 0
[*] 492f4b276ebddc0e391f1ce39201849e38a2df20:  # of scenes = 0,  # of dialogs = 0
[*] 49981df2afcb8a656e10eb87b3acd859783eb046:  # of scenes = 0,  # of dialogs = 0
[*] 4a1fff119d01e5ede3da8b3c13c3925484720533:  # of scenes = 0,  # of dialogs = 0
Segmenting:  30%|██████████████████████████████████████▋                                                                                        | 132/433 [00:01<00:03, 90.09it/s][*] 4dcfd38222f416db7aa1add3ecd85e2c9d6160d1:  # of scenes = 0,  # of dialogs = 0
[*] 4f17e590323c45bf12f789e5990d6ab90698671a:  # of scenes = 0,  # of dialogs = 0
[*] 4f485054f9d450534fddba184f0996e32575d1be:  # of scenes = 0,  # of dialogs = 0
[*] 5041b6dbfc48abb92a0c118fcb358a3da92bef34:  # of scenes = 0,  # of dialogs = 0
[*] 5170566c90131e41863b393cc12d1df9f9c17cc4:  # of scenes = 0,  # of dialogs = 0
[*] 51f52296a99ce50b6ca69aa5939d81a5aea4e042:  # of scenes = 0,  # of dialogs = 0
[*] 523a2eb1ae686d7bf0e664c89d0a490a7e1a22bc:  # of scenes = 0,  # of dialogs = 0
[*] 5339e9db4aca0b74b644e736274989344864f0ba:  # of scenes = 0,  # of dialogs = 0
Segmenting:  33%|██████████████████████████████████████████▌                                                                                    | 145/433 [00:01<00:02, 99.48it/s][*] 55d1940b8c8e1c73e175bac2fce0a4f9844fff02:  # of scenes = 0,  # of dialogs = 0
[*] 567c1a39aff9590875a843a9df35f8c5b880ae27:  # of scenes = 0,  # of dialogs = 0
[*] 5685121eb6af89082a190b3383cdee15a2ae83ff:  # of scenes = 0,  # of dialogs = 0
[*] 570c8d99794d019e34b3ead4b4fcf80bb0fac459:  # of scenes = 0,  # of dialogs = 0
[*] 5806789ebca66b68e86317d0e09b8a433c236280:  # of scenes = 0,  # of dialogs = 0
[*] 5ad9844f125d7051ba23edbb8b48a74f4f6102c8:  # of scenes = 0,  # of dialogs = 0
[*] 5c1d04428ffd3f0ddd732a723e61371e0255aa49:  # of scenes = 0,  # of dialogs = 0
Segmenting:  36%|█████████████████████████████████████████████▊                                                                                 | 156/433 [00:01<00:02, 96.32it/s][*] 5e68bb2ae2af335f5f828966209fb6e97621005f:  # of scenes = 0,  # of dialogs = 0
[*] 5ea4ebbc0d86c3f629932a6f2470949776e58579:  # of scenes = 0,  # of dialogs = 0
[*] 5f4f9df9f707d382b33c73db83b316a60599a6ea:  # of scenes = 0,  # of dialogs = 0
[*] 5fd696aa16bb2f03de7af3be71ca9047d3a82935:  # of scenes = 0,  # of dialogs = 0
[*] 6031da6fa93ad9cac1b6da6586010aab81c7b4da:  # of scenes = 0,  # of dialogs = 0
[*] 60ea304651d0be17e4d8e51fa1cb896437021d6a:  # of scenes = 0,  # of dialogs = 0
Segmenting:  38%|████████████████████████████████████████████████▋                                                                              | 166/433 [00:01<00:02, 91.46it/s][*] 634abf83f11ee4449864e1c391b0456b3fdbdf64:  # of scenes = 0,  # of dialogs = 0
[*] 64e583bde7ea2b98c40e03756a32ce31be036af2:  # of scenes = 0,  # of dialogs = 0
[*] 65fdaaca567f74e28159d1c6a7b6cfeec34316ff:  # of scenes = 0,  # of dialogs = 0
[*] 677cda46d079c6df650914eda8d9da1dcda8bf8d:  # of scenes = 0,  # of dialogs = 0
[*] 67ca24897196449395daa9886c7fbaceab55c964:  # of scenes = 0,  # of dialogs = 0
[*] 681b019e94056cbe4f7a13c8323eae0a65ebdbec:  # of scenes = 0,  # of dialogs = 0
[*] 683105161212ee6e3f95eb2857623ad77b088af9:  # of scenes = 0,  # of dialogs = 0
[*] 68ee401e0c66832834f605b625d5062b06a59515:  # of scenes = 0,  # of dialogs = 0
Segmenting:  41%|███████████████████████████████████████████████████▌                                                                           | 176/433 [00:01<00:02, 90.49it/s][*] 69099d7d543fad22f61d8acf97c681c9c86cac0e:  # of scenes = 0,  # of dialogs = 0
[*] 6a02d46e87865ba5b033c56c658af2bfdd182093:  # of scenes = 0,  # of dialogs = 0
[*] 6a14d1ba1aa1a6719840cb98a8feb6a711291aa4:  # of scenes = 0,  # of dialogs = 0
[*] 6a2f59a81c3730b5e1d5f685bac8bbbe547b7b3b:  # of scenes = 0,  # of dialogs = 0
[*] 6aa4c6be53eff0024e5c84f99ac94cddff4eb8f0:  # of scenes = 0,  # of dialogs = 0
[*] 6bec7c2bdd0b01296bc9020288d833709e54cd52:  # of scenes = 0,  # of dialogs = 0
[*] 6c2c91cc1e5aa0a597e6c5c0d7029e8672c0f60d:  # of scenes = 0,  # of dialogs = 0
[*] 6d2e1b95ed2a00e16e046b6f3d0b03e687c7f7f7:  # of scenes = 0,  # of dialogs = 0
Segmenting:  43%|██████████████████████████████████████████████████████▌                                                                        | 186/433 [00:02<00:02, 85.65it/s][*] 6d925e92824fabd7513cd864687c29d6ee3e5c2d:  # of scenes = 0,  # of dialogs = 0
[*] 6ed5d860df4edd3d32f1aca52ec369f0d21039a0:  # of scenes = 0,  # of dialogs = 0
[*] 6ffb5d981d101fbbd43d97bc45720388570f2c61:  # of scenes = 0,  # of dialogs = 0
[*] 70794150f324949ca49f182db0d3f8d69d0c779e:  # of scenes = 0,  # of dialogs = 0
[*] 70b3f55e376d461b1cb7dc5005c02091478e2e44:  # of scenes = 0,  # of dialogs = 0
[*] 71ce19cf034c830c1e2d8b98682ca1d53ead1067:  # of scenes = 0,  # of dialogs = 0
[*] 7235e7853e8ea8ba99b6d7a386d8de03b3887ace:  # of scenes = 0,  # of dialogs = 0
Segmenting:  45%|█████████████████████████████████████████████████████████▊                                                                     | 197/433 [00:02<00:02, 89.93it/s][*] 7415641a4aa3cc0b71657573197bfc9d48694e03:  # of scenes = 0,  # of dialogs = 0
[*] 78e0e28b686c7157d598082bfaa8aaaab821b78b:  # of scenes = 0,  # of dialogs = 0
[*] 7b85063db83a2ba9cba23510c61d769626b604a3:  # of scenes = 0,  # of dialogs = 0
[*] 7c41a74fb0e568bd5fc658282facbf79bf541271:  # of scenes = 0,  # of dialogs = 0
[*] 7c531dc41880b96b8acfaf66a39a85b91d68b636:  # of scenes = 0,  # of dialogs = 0
[*] 7cce97e00d2de0f9c3a19e5ac5c05f6449a77642:  # of scenes = 0,  # of dialogs = 0
Segmenting:  48%|████████████████████████████████████████████████████████████▋                                                                  | 207/433 [00:02<00:02, 86.59it/s][*] 7ef394b11230baf81f61e790839fa993c6ea1f72:  # of scenes = 0,  # of dialogs = 0
[*] 7f1cb0e615795ed6be5d96ea3f13ce62921d8835:  # of scenes = 0,  # of dialogs = 0
[*] 7f4bce4058ac3cf2ff5cfb36908c8e6d891797e1:  # of scenes = 0,  # of dialogs = 0
[*] 81d2ece8e55ac0a2799aa87a43117f01bbd7506d:  # of scenes = 0,  # of dialogs = 0
[*] 81db4f3e5fe29c02fd7b7a702aa4847db6a04613:  # of scenes = 0,  # of dialogs = 0
Segmenting:  50%|███████████████████████████████████████████████████████████████▋                                                               | 217/433 [00:02<00:02, 88.76it/s][*] 83a1fd492021ceb110451d65888072daf64a5d4f:  # of scenes = 0,  # of dialogs = 0
[*] 84327ef84b778b11993de1d2e3f6fb04eeb09fff:  # of scenes = 0,  # of dialogs = 0
[*] 84732f85b51dfbfed6c40f2bc1e35e1697eade8e:  # of scenes = 0,  # of dialogs = 0
[*] 8501cab146742babe04bc3984eb34409d97078a1:  # of scenes = 0,  # of dialogs = 0
[*] 854dd2f347a89b5653a9e8372541af6fc3590254:  # of scenes = 0,  # of dialogs = 0
[*] 85591366e6f0ab31815c40d55e1d4d5182d20ec5:  # of scenes = 0,  # of dialogs = 0
Segmenting:  52%|██████████████████████████████████████████████████████████████████▎                                                            | 226/433 [00:02<00:02, 89.07it/s][*] 88545fa9a5ab807238066a7ab8867ac3adbcd03b:  # of scenes = 0,  # of dialogs = 0
[*] 88cce939b62c833842ccfc1e0fa7534288626c86:  # of scenes = 0,  # of dialogs = 0
[*] 8928a603a3c6d154ea4547060805a966ce0f4d60:  # of scenes = 0,  # of dialogs = 0
[*] 8b1ac9ad821c24ab658e4977aa169ff195f4967f:  # of scenes = 0,  # of dialogs = 0
[*] 8b9da16420edd653ae5e0e2925dd3cade3a21ebc:  # of scenes = 0,  # of dialogs = 0
[*] 8cea29a5fd9e324e9ff07ab2e4a1521d591cb318:  # of scenes = 0,  # of dialogs = 0
[*] 8dddfb1c4aa33c5821670ba20549ec02aba73056:  # of scenes = 0,  # of dialogs = 0
Segmenting:  55%|█████████████████████████████████████████████████████████████████████▌                                                         | 237/433 [00:02<00:02, 93.57it/s][*] 91075ab5383e8113d86b17536c6918bfbbd6af20:  # of scenes = 0,  # of dialogs = 0
[*] 91293cd81f021de45cffd363ef81dd95d2c122a3:  # of scenes = 0,  # of dialogs = 0
[*] 916835cb4bcb3baa6333e7cca25bef7710dbdcbc:  # of scenes = 0,  # of dialogs = 0
[*] 92134f0c9dc82e7b2cc9afd5896ae8dc7d6d088e:  # of scenes = 0,  # of dialogs = 0
[*] 9292406d5193d3402195df7e5647fd168da2d15a:  # of scenes = 0,  # of dialogs = 0
[*] 93116caf60209c52bca7cd0b51ccc366eb90f6c6:  # of scenes = 0,  # of dialogs = 0
[*] 94f1c8eb8ce7f271eb52c6ae9071ae1b56dabfcb:  # of scenes = 0,  # of dialogs = 0
Segmenting:  58%|████████████████████████████████████████████████████████████████████████▋                                                     | 250/433 [00:02<00:01, 102.04it/s][*] 95315365bf6b07073562333fa15403f68e040e70:  # of scenes = 0,  # of dialogs = 0
[*] 953a454c146650e09d27fdb4dbe3cfdc6a9ca697:  # of scenes = 0,  # of dialogs = 0
[*] 987d3e31dc8b6029a2b9450243e122c8862bbc24:  # of scenes = 0,  # of dialogs = 0
[*] 9990281afdfbf883d53ac7540acfcbb375380f75:  # of scenes = 0,  # of dialogs = 0
[*] 999a532b45f030c8f382f0f92fc51b3d12fd821c:  # of scenes = 0,  # of dialogs = 0
[*] 9a1f8183b31ba8ab9f1719fa42b36a173035beb4:  # of scenes = 0,  # of dialogs = 0
[*] 9c2bb97cbbb8dca3fb1d85fbe1abeb27ad046615:  # of scenes = 0,  # of dialogs = 0
Segmenting:  60%|████████████████████████████████████████████████████████████████████████████▌                                                  | 261/433 [00:02<00:01, 90.42it/s][*] 9cbe9d08ff6673e8dba308ac11ba88d71b425209:  # of scenes = 0,  # of dialogs = 0
[*] 9cd25b973d253386eaebbb7f2f7821dc7518f6d6:  # of scenes = 0,  # of dialogs = 0
[*] 9d8b79107628c4dedae91fa3648e56ae94b997fc:  # of scenes = 0,  # of dialogs = 0
[*] 9d8ddfe86beba149b3463eeb1bae92919e179fed:  # of scenes = 0,  # of dialogs = 0
[*] 9e1475e4ee95eb7b4dca656e5afde2aaf5fdda8a:  # of scenes = 0,  # of dialogs = 0
[*] 9ebb84bdc9cc6d698ccc331437bd1ec3b5f0dddb:  # of scenes = 0,  # of dialogs = 0
[*] 9f83c8e49f5a53b211caf37cbdc659f97d2ef30a:  # of scenes = 0,  # of dialogs = 0
Segmenting:  63%|███████████████████████████████████████████████████████████████████████████████▊                                               | 272/433 [00:02<00:01, 93.21it/s][*] a13cfb713f7eca7e750b8ad20b946b142aaa5dbf:  # of scenes = 0,  # of dialogs = 0
[*] a18e921ea3e947642147754dffa769f9eabb31e7:  # of scenes = 0,  # of dialogs = 0
[*] a32f354788dd1f1411b9745200cb330d9b556373:  # of scenes = 0,  # of dialogs = 0
[*] a34a30243522a835f826fe5269b4d234c5443ff3:  # of scenes = 0,  # of dialogs = 0
[*] a3d22e30a6afde892a65e16db0454093a232da87:  # of scenes = 0,  # of dialogs = 0
[*] a3f5043d31f3d18b625f75f69392834d4479df38:  # of scenes = 0,  # of dialogs = 0
Segmenting:  65%|██████████████████████████████████████████████████████████████████████████████████▋                                            | 282/433 [00:03<00:01, 90.78it/s][*] a493bc348052f6962bb423ad44b2e13134d635fc:  # of scenes = 0,  # of dialogs = 0
[*] a65c4ede45179964de0abc38a88aecd767eaf505:  # of scenes = 0,  # of dialogs = 0
[*] a69fee0515cdb067fec5c42fa88ace5c9639118c:  # of scenes = 0,  # of dialogs = 0
[*] a8549480950ac906c9426b7d8cb7963e52e4cd6c:  # of scenes = 0,  # of dialogs = 0
[*] a8771e0706c5b98b55c60b8dc3b9668295315712:  # of scenes = 0,  # of dialogs = 0
[*] a87d47ee243b5cc4f8ce3acf483ccf9e77083e16:  # of scenes = 0,  # of dialogs = 0
Segmenting:  67%|█████████████████████████████████████████████████████████████████████████████████████▋                                         | 292/433 [00:03<00:01, 86.28it/s][*] a912a7308035d8c7136ca7439bce977ed57951f4:  # of scenes = 0,  # of dialogs = 0
[*] acdeff3068372e962ae2600b4883e5cd62c04150:  # of scenes = 0,  # of dialogs = 0
[*] ad0ec287e1d4d99b7ed905a7765e56f79eecb9ef:  # of scenes = 0,  # of dialogs = 0
[*] ad63a9aa1afa22bf51451da2e2f45d9543c9ca62:  # of scenes = 0,  # of dialogs = 0
[*] add8e2efd820ef2778c746e2d3d6c0cd8c650672:  # of scenes = 0,  # of dialogs = 0
[*] aecd2050091b2c8f57190bc9a1c0a1d81d6b2f56:  # of scenes = 0,  # of dialogs = 0
[*] b106547216886fe269040370774d80b5a5a53318:  # of scenes = 0,  # of dialogs = 0
[*] b1b67dc8de94763e66312262449f6cd55956f1f0:  # of scenes = 0,  # of dialogs = 0
Segmenting:  70%|████████████████████████████████████████████████████████████████████████████████████████▌                                      | 302/433 [00:03<00:01, 87.40it/s][*] b236a647b02cd1b003edbf96ab501e8d6aea6c6d:  # of scenes = 0,  # of dialogs = 0
[*] b31c8b60d28467403c1615ec883b167a0c103835:  # of scenes = 0,  # of dialogs = 0
[*] b4010f552ee32bf8b2f4dca248f176b0bf2602b1:  # of scenes = 0,  # of dialogs = 0
[*] b5038cd75a0f275ec87cd993eba3c2af3731bc6c:  # of scenes = 0,  # of dialogs = 0
[*] b50c5113b46d135015347f095947181fbe5fe30f:  # of scenes = 0,  # of dialogs = 0
[*] b598355c6a88e44b884cac7c54b9c955af552e7e:  # of scenes = 0,  # of dialogs = 0
[*] b61a35a3025b1b738c257af2a2b5af9a52222304:  # of scenes = 0,  # of dialogs = 0
[*] b6ed74a969cdbb5ddc34575df8b7ee9d955d5556:  # of scenes = 0,  # of dialogs = 0
Segmenting:  72%|███████████████████████████████████████████████████████████████████████████████████████████▊                                   | 313/433 [00:03<00:01, 91.20it/s][*] b833410a8ff29952ca664319cb3462f2fd07d4f9:  # of scenes = 0,  # of dialogs = 0
[*] b961b2d182f89c9c4259ffd8001df871ead32ef0:  # of scenes = 0,  # of dialogs = 0
[*] ba74352102b49e66de4485552e80012fe38adb20:  # of scenes = 0,  # of dialogs = 0
[*] bd14fef15878fdac1e9c2d2dbe52df0951f38aad:  # of scenes = 0,  # of dialogs = 0
[*] bddb5d539d43bd3d63ca9ff7266e424eb5854521:  # of scenes = 0,  # of dialogs = 0
[*] bde72e62aabedf831fc81b5847152a244fd97bee:  # of scenes = 0,  # of dialogs = 0
[*] bf2e4b5bd8d06325ffe3fcee4af99b9d95ee34e4:  # of scenes = 0,  # of dialogs = 0
[*] bf438dd002b209fa4550cd56752c6549428fc4bc:  # of scenes = 0,  # of dialogs = 0
[*] bfd50b1cf73709308ab4ad727d829c5fca23480c:  # of scenes = 0,  # of dialogs = 0
[*] c002791c4ff779710ca7ba8d8cde2ac4b27d28b3:  # of scenes = 0,  # of dialogs = 0
Segmenting:  75%|██████████████████████████████████████████████████████████████████████████████████████████████▋                                | 323/433 [00:03<00:01, 91.00it/s][*] c0e05b3aac173e39064f07ccb60fb0a30430f824:  # of scenes = 0,  # of dialogs = 0
[*] c2e872845e99f82ddfa5a3ed84f985a04b6729e6:  # of scenes = 0,  # of dialogs = 0
[*] c3d05fedec86ea11bb70837d54b941589bde4d88:  # of scenes = 0,  # of dialogs = 0
[*] c5384ac7f6a3e69a17ede247235936b934a71a03:  # of scenes = 0,  # of dialogs = 0
[*] c551edfbb8240501afd55b17495674b9a04060e3:  # of scenes = 0,  # of dialogs = 0
Segmenting:  77%|█████████████████████████████████████████████████████████████████████████████████████████████████▋                             | 333/433 [00:03<00:01, 93.45it/s][*] c65f47d3de4510d418357b1f133d3171f0bc4eca:  # of scenes = 0,  # of dialogs = 0
[*] c72161c89a7dc8ea5d62b200689bd2acae6f354d:  # of scenes = 0,  # of dialogs = 0
[*] c7c075c49018828bf6027da5c5534834779d1adf:  # of scenes = 0,  # of dialogs = 0
[*] c96a12df9414dca862b7f1d5882dadb40e152121:  # of scenes = 0,  # of dialogs = 0
[*] cb456274929bf53ca118acacea7175e9f25c99bd:  # of scenes = 0,  # of dialogs = 0
[*] cbb15e9755f01a017965f239be1cd3b9277f69ed:  # of scenes = 0,  # of dialogs = 0
[*] cc49ee763ad73ea914a925f7dddf3687b77a69c9:  # of scenes = 0,  # of dialogs = 0
[*] ccde92fd5de1e67f17e10d9e0cd3375ce4efcf23:  # of scenes = 0,  # of dialogs = 0
Segmenting:  79%|████████████████████████████████████████████████████████████████████████████████████████████████████▌                          | 343/433 [00:03<00:00, 92.15it/s][*] ccdf8c8c07e95675fae3591714061ecacfd5ad2e:  # of scenes = 0,  # of dialogs = 0
[*] cd13eb380843224f66c34047cc06dc445a92f8fd:  # of scenes = 0,  # of dialogs = 0
[*] ce23d22d3c3b9297322ea9573f356987ecdeeeb6:  # of scenes = 0,  # of dialogs = 0
[*] ce8cb184a11535e7a7c824c82b7772a1c3a7c92c:  # of scenes = 0,  # of dialogs = 0
[*] cf873fb685ac6b1bd09c733ad9b0c0130d109454:  # of scenes = 0,  # of dialogs = 0
[*] d053fe3c068b8a68d07d7384056afd16935e608a:  # of scenes = 0,  # of dialogs = 0
[*] d079071fecafe63e2939d8e866c36819c21b907c:  # of scenes = 0,  # of dialogs = 0
[*] d094b01390b1598e80f7fd148ef0987e882f04ab:  # of scenes = 0,  # of dialogs = 0
Segmenting:  82%|███████████████████████████████████████████████████████████████████████████████████████████████████████▊                       | 354/433 [00:03<00:00, 95.08it/s][*] d15686cf4482b52351e990ccd991fefda8d2f6dc:  # of scenes = 0,  # of dialogs = 0
[*] d1960641caba4c85c372a2177e6727ad948c7005:  # of scenes = 0,  # of dialogs = 0
[*] d226b0c7fb662f93cc3298e3caa21212f59e0a36:  # of scenes = 0,  # of dialogs = 0
[*] d2e4700544066553a0c434d4861e9b7c4cdfbd7b:  # of scenes = 0,  # of dialogs = 0
[*] d48a25702aca65bfc7755c1dbcb5c196593af1ee:  # of scenes = 0,  # of dialogs = 0
[*] d5b32abd0fe5966b8c619084932c5d832d51f063:  # of scenes = 0,  # of dialogs = 0
[*] d66fe35ce1d4d1166add716e366c04a84618cabe:  # of scenes = 0,  # of dialogs = 0
Segmenting:  84%|██████████████████████████████████████████████████████████████████████████████████████████████████████████▊                    | 364/433 [00:04<00:00, 93.35it/s][*] d6c728cc9fabd2ef68dabc990731470f455e8fac:  # of scenes = 0,  # of dialogs = 0
[*] d75d97726ac6b229b809cb7e482606024c2e564a:  # of scenes = 0,  # of dialogs = 0
[*] d8844d709aa624a5ffe70f185dc68488839d37ea:  # of scenes = 0,  # of dialogs = 0
[*] d8aeeba694332530d1ca1647779c3228959aa20a:  # of scenes = 0,  # of dialogs = 0
[*] d984035756201895c1acd9233775031cc9c0a30c:  # of scenes = 0,  # of dialogs = 0
[*] d9df8732f4fad8d4ffa6d8b2f7af12ea374a5be2:  # of scenes = 0,  # of dialogs = 0
[*] dbb845a9465690011b39ffd4408c5a41db58d97a:  # of scenes = 0,  # of dialogs = 0
[*] dbc088fbc6dd9efb6b6b7d8821f73eb0f1759db4:  # of scenes = 0,  # of dialogs = 0
Segmenting:  87%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████▉                 | 375/433 [00:04<00:00, 95.51it/s][*] dc8d6c5a9a9cb0ee6cc3b47ed9aa7a6f6209d05e:  # of scenes = 0,  # of dialogs = 0
[*] dcd3468c3e18822f08ff0607edfb3e9ca06f3ef0:  # of scenes = 0,  # of dialogs = 0
[*] ddd55023a3dd6800331b50c560f74390f85f1e06:  # of scenes = 0,  # of dialogs = 0
[*] deae2c2c3964684550d73f691762da489f9782f7:  # of scenes = 0,  # of dialogs = 0
[*] dfea98678342e17dbfce44c7906602788cc2267c:  # of scenes = 0,  # of dialogs = 0
[*] e0c74cdf270ebe29a2139e7319fc7314738c88ee:  # of scenes = 0,  # of dialogs = 0
[*] e1c042c57411f230068ededfd7b27e44c0580700:  # of scenes = 0,  # of dialogs = 0
[*] e292eae2486862f6df6ff388cb2dd6777bc73f27:  # of scenes = 0,  # of dialogs = 0
Segmenting:  89%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████▉              | 385/433 [00:04<00:00, 94.18it/s][*] e3793c8072528b1d92b8614113e6d2b5748652cd:  # of scenes = 0,  # of dialogs = 0
[*] e4081796caf2b0354f1fc78626b7a74396979e5b:  # of scenes = 0,  # of dialogs = 0
[*] e60d35663e8d38fa4bde3bff0690ab2ca735fd74:  # of scenes = 0,  # of dialogs = 0
[*] e6ff33cf1eb66a9bcffffbeb0866ec6ccee7f3af:  # of scenes = 0,  # of dialogs = 0
[*] e80dcfbc4d200c173d6ac969a9b160a40a1edf70:  # of scenes = 0,  # of dialogs = 0
[*] ea5d07dd2150a3e4fd5199ab496074839a019ded:  # of scenes = 0,  # of dialogs = 0
[*] ea6f69c29b491c58796029a66f029e552db2819d:  # of scenes = 0,  # of dialogs = 0
Segmenting:  91%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████▊           | 395/433 [00:04<00:00, 90.31it/s][*] ec14a0a6712974227acf09046812d4c51fef364a:  # of scenes = 0,  # of dialogs = 0
[*] ec5123faf9944ed8ef012cce89123db124242b3d:  # of scenes = 0,  # of dialogs = 0
[*] ee7e2ef2ecfa84682214c65ed178f959eaffb8ea:  # of scenes = 0,  # of dialogs = 0
[*] ef722cf82033c8e66197209f06a9cb9754be78d9:  # of scenes = 0,  # of dialogs = 0
[*] ef92e6a5b6fe08813c84e8349ddd2cb2dc842bc7:  # of scenes = 0,  # of dialogs = 0
[*] f130ad4c5c491e444e60dddc228e73a592bf8f18:  # of scenes = 0,  # of dialogs = 0
[*] f225a22410b95923cccefe1a5eb04075c4184376:  # of scenes = 0,  # of dialogs = 0
Segmenting:  94%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████▊        | 405/433 [00:04<00:00, 86.16it/s][*] f246970289decf6f7a6bd44088d16b118aeaae8d:  # of scenes = 0,  # of dialogs = 0
[*] f5255dcda3e92492cca0b95687bf01d0908b07b4:  # of scenes = 0,  # of dialogs = 0
[*] f6470b27b43e232e5b4458fb1dd6c194cddb2452:  # of scenes = 0,  # of dialogs = 0
[*] f6de97a4d111d0663b747eb10e123952424786d0:  # of scenes = 0,  # of dialogs = 0
Segmenting:  96%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████▍     | 414/433 [00:04<00:00, 86.42it/s][*] f750ac4984453071cb5e82de093018e4d70a4f8d:  # of scenes = 0,  # of dialogs = 0
[*] f75afa70c82c3f894abccad514688a835e45c600:  # of scenes = 0,  # of dialogs = 0
[*] f7bb9eb9306b79cad4b6466f2ac3dcbd0e5fa63a:  # of scenes = 0,  # of dialogs = 0
[*] f7bf427e41af53409d7907160f7908e723b78eb0:  # of scenes = 0,  # of dialogs = 0
[*] f7f0a6294e5fe018d584fe29c7c661fc2bf1f86e:  # of scenes = 0,  # of dialogs = 0
[*] f865713a51422129cd8d15ea2bb1ac324d65afdb:  # of scenes = 0,  # of dialogs = 0
[*] f8b3d0124f396d92b58e396b6ab8e2368360c27e:  # of scenes = 0,  # of dialogs = 0
[*] f914d107471567d657d0ac815863dfd079c198db:  # of scenes = 0,  # of dialogs = 0
Segmenting:  98%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████▎  | 424/433 [00:04<00:00, 89.20it/s][*] fdfbfabc1a72a0fb7f31ac7ad9e1ced05c838ef0:  # of scenes = 0,  # of dialogs = 0
[*] fe7390fde95a9ec85a35e2de5a869fcc7c7f1a34:  # of scenes = 0,  # of dialogs = 0
[*] fea54b235b1c054d1c90e87f57e3bfb64cbf3a5b:  # of scenes = 0,  # of dialogs = 0
[*] ff53fd53a94f343b8365915645b79d7ad5b1528e:  # of scenes = 0,  # of dialogs = 0
[*] ffae045d630abf7e4c282849d16819ceff60c2b0:  # of scenes = 0,  # of dialogs = 0
[*] ffcf7daee9cda766d2fcf1f6399b29be41876b21:  # of scenes = 0,  # of dialogs = 0
Segmenting: 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 433/433 [00:04<00:00, 90.96it/s]
100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 116/116 [00:00<00:00, 198.34it/s]
[*] # of training samples:  181831
[*] Saved in <./preprocessed/bad_format_imsdb_self_collected_add_space.pkl>
100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 175/175 [00:00<00:00, 41780.69it/s]
[*] # of training samples:  0
[*] Saved in <./preprocessed/by_stats_imsdb_self_collected_add_space.pkl>
100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 125/125 [00:00<00:00, 40928.02it/s]
[*] # of training samples:  0
[*] Saved in <./preprocessed/silver_imsdb_self_collected_add_space.pkl>
Traceback (most recent call last):
  File "parse.py", line 765, in <module>
    script_chunks_df = merge_data()
  File "parse.py", line 541, in merge_data
    silver_df['text'] = silver_df_with_space['text'].values     
  File "/data2/byeongjuncho/anaconda3/lib/python3.8/site-packages/pandas/core/frame.py", line 3163, in __setitem__
    self._set_item(key, value)
  File "/data2/byeongjuncho/anaconda3/lib/python3.8/site-packages/pandas/core/frame.py", line 3242, in _set_item
    value = self._sanitize_column(key, value)
  File "/data2/byeongjuncho/anaconda3/lib/python3.8/site-packages/pandas/core/frame.py", line 3899, in _sanitize_column
    value = sanitize_index(value, self.index)
  File "/data2/byeongjuncho/anaconda3/lib/python3.8/site-packages/pandas/core/internals/construction.py", line 751, in sanitize_index
    raise ValueError(
ValueError: Length of values (0) does not match length of index (455894)

I found problem in code

...
def merge_data():
    ### BookQA part ###
    scene_df = pickle.load(open('./preprocessed/bookQA_NER_add_space.pkl', "rb"))
    # import storedScript.csv
    movie_name_to_id_mapping = pd.read_csv('./narrative_qa/storedScript.csv')
    scene_df = scene_df.merge(movie_name_to_id_mapping, left_on="book_id", right_on="id", how="left")
    scene_df.drop('id', axis=1, inplace=True)
    scene_df = scene_df.rename(columns={'movieName': 'movie_name'})
    scene_df['source'] = 'old'

    ### silver part ###
    silver_df = pickle.load(open('./preprocessed/NER_silver_imsdb_self_collected_no_lower_preds.pkl', "rb"))
    silver_df_with_space = pd.read_pickle('./preprocessed/silver_imsdb_self_collected_add_space.pkl')  ## <=== This file is empty
# silver_df['text1'] = silver_df_with_space['text'].values
    # silver_df = silver_df.drop(columns=['text'], axis=1)
    # silver_df = silver_df.rename(columns= {'text1':'text'})
    # TODO: Is the following line equivalent to the above three?
    # TODO: no "text" in silver_df_with_space, only "sentence"
    silver_df['text'] = silver_df_with_space['text'].values       # <== So This code arise error because "silver_df_with_space" is empty dataframe
    silver_df['movie_name'] = silver_df['book_id'].str.lower().replace('-', ' ')
    silver_df['source'] = 'new'
    silver_df = silver_df.drop(['label'], axis=1)
    silver_df = silver_df.rename(columns={'label': 'predsWithTitle'})
...

Is any solution in this error?

Thank you.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.