Coder Social home page Coder Social logo

story2personality's Introduction

Story2Personality

The dataset is a new narrative understanding benchmark to predict personality according to the character’s narrative texts in the script. We release the dataset and the codes for our work accepted to NAACL Student Research Workshop 2022: Machine Narrative Comprehension in Fictional Characters Personality Prediction Task and EMNLP 2022 MBTI Personality Prediction for Fictional Characters Using Movie Scripts.

Step 0: Env Setup

conda env create -f person_environment.yml python=3.8 pandas=1.5.2
conda activate person
python -m spacy download en

Step 1: Data Parsing

Our data parser first reads the narrative books and movie scripts from HTML files, and then extracts utterances said by recognized characters. The whole process can take 3~5 hours to finish. If you are only interested in the data, you can download them via this link and unzip to the root folder.

# move the downloaded "dialog_scene_mention_dicts.zip" to the root folder
unzip dialog_scene_mention_dicts.zip

If you would like to know how the raw text data is processed, you will have to download the HTML files first from OneDrive. The contents are the union of NarrativeQA dataset and Movie-Script-Database. Please unzip the downloaded file to the root repo folder.

# move the downloaded "raw_texts.zip" to the root folder
unzip raw_texts.zip

We are also sharing some other preprocessed files in the preprocessed/ folder which are also the dependencies of our parser. The following command would generate dialog_dict.pickle, scene_dict.pickle, and mention_dict.pickle from scratch.

python parse.py

Hereto, you will get three .pickle files which contain dictionaries of "what people say" and "who are mentioned" in a dialogue or a scene.

Step 2: Model Training and Inferencing

To use the data for modeling, please go to dataset/ and download one of the tokenized datasets. The format is more readily for training and testing than those .pickle files. More details will be provided in the future.

Citation

If you find this repo useful, please consider citing our paper:

@article{sang2022mbti,
  title={MBTI Personality Prediction for Fictional Characters Using Movie Scripts},
  author={Sang, Yisi and Mou, Xiangyang and Yu, Mo and Wang, Dakuo and Li, Jing and Stanton, Jeffrey},
  journal={arXiv preprint arXiv:2210.10994},
  year={2022}
}

story2personality's People

Contributors

moutaigua8183 avatar

Stargazers

LazyPlayer avatar Cheng Li @ SenseTime avatar Yahui Fu avatar Davide avatar Aria F avatar Eunchan Lee avatar ZanD avatar Bingsheng Yao avatar Kevin Saltarelli avatar Rui Ribeiro avatar 陈越 (Chen Yue) avatar Kyumin Lee avatar Praveen Sridhar avatar  avatar Dakuo Wang avatar Dawei Li avatar

Watchers

 avatar  avatar

story2personality's Issues

parse.py running problem

Hello, I have interest your dataset. So I try your code but I meet some error as below.

(base) byeongjuncho@2080C:/data5/byeongjuncho2/Personality/Story2Personality$ python parse.py 
[*] Columns in documents.csv: ['document_id', 'set', 'kind', 'story_url', 'story_file_size', 'wiki_url', 'wiki_title', 'story_word_count', 'story_start', 'story_end']
Analyze segmentation signals: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████| 433/433 [00:43<00:00,  9.96it/s]
[*] Segmentation Analysis Results
  - Success: 300
  - Fail: 121
  - # of movies: 433
Segmenting:   0%|                                                                                                                                         | 0/433 [00:00<?, ?it/s][*] 00ee9e01a0e581e0d8cbf7e865a895147c480c5e:  # of scenes = 0,  # of dialogs = 0
[*] 00f9dbb0a851bc6099d5216e5fa8719b2ac3b82b:  # of scenes = 0,  # of dialogs = 0
[*] 020773a0ca71155173ec4affe6a2496a6cb45216:  # of scenes = 0,  # of dialogs = 0
[*] 032bcfd170a98fb5ed752c1d25b678b8589de7d7:  # of scenes = 0,  # of dialogs = 0
[*] 0434145d9284423a36a714fc55246ed0bdc39a82:  # of scenes = 0,  # of dialogs = 0
Segmenting:   2%|██▉                                                                                                                             | 10/433 [00:00<00:04, 92.47it/s][*] 04dc9134485dee27252734f24c2a3b3dd397b8a4:  # of scenes = 0,  # of dialogs = 0
[*] 050f88f6c8fed44ecbd7658b0f450fb705ce368d:  # of scenes = 0,  # of dialogs = 0
[*] 08373775c3d82d8bbf4ff286a3a0076a3627c652:  # of scenes = 0,  # of dialogs = 0
[*] 087c29795e06ed5b8e29361606c12b6390db9270:  # of scenes = 0,  # of dialogs = 0
[*] 0a93e857113efca05c6274e7af3ba4f03a023b9f:  # of scenes = 0,  # of dialogs = 0
[*] 0b9f563bad33316d94f4f339fb151a4b33f25c3f:  # of scenes = 0,  # of dialogs = 0
Segmenting:   5%|██████▏                                                                                                                        | 21/433 [00:00<00:04, 100.67it/s][*] 0bd270c8c46f84a4abd99d65e2f17a9e11a7f76d:  # of scenes = 0,  # of dialogs = 0
[*] 0c112bc4c07ded9222ae308d1f20591b76aed4de:  # of scenes = 0,  # of dialogs = 0
[*] 0cb3433c5ac030dba47414e2655c3c49e4f37527:  # of scenes = 0,  # of dialogs = 0
[*] 0eae0c4823bd37d58a7b23798c37fd0818bad032:  # of scenes = 0,  # of dialogs = 0
[*] 0edab258db9c1e0483eeaa442e0522bdefa9d445:  # of scenes = 0,  # of dialogs = 0
[*] 0fe1c42d024d1d3f001a9aadb400512ecbda2b9f:  # of scenes = 0,  # of dialogs = 0
[*] 0ffdd0e4abb67f0012a68d719a2d509ee8ec643d:  # of scenes = 0,  # of dialogs = 0
[*] 117ba539f2d311e3067064dca3b5fd68a7b95ee3:  # of scenes = 0,  # of dialogs = 0
Segmenting:   7%|█████████▍                                                                                                                      | 32/433 [00:00<00:04, 94.57it/s][*] 121c28edd4b0474f2386d9290dcee15b5352e006:  # of scenes = 0,  # of dialogs = 0
[*] 139004cd2411c6909a752ecf897d326ed8bf419c:  # of scenes = 0,  # of dialogs = 0
[*] 141a5885bb6d8df56dadbd617b5dec7db102701d:  # of scenes = 0,  # of dialogs = 0
[*] 145200abf14baeffa646797dfbfa58861cb4b079:  # of scenes = 0,  # of dialogs = 0
[*] 14add440692bd1e8fbb8c2df367ba4cb27426383:  # of scenes = 0,  # of dialogs = 0
[*] 150683ab8156bdc78ebd23a2ed7f7e265b780bd0:  # of scenes = 0,  # of dialogs = 0
[*] 1508452704b5829941b98aeee2eeba192f5beae5:  # of scenes = 0,  # of dialogs = 0
[*] 1625d7038823e0ca81c39675c3ea7fe31f7fbe04:  # of scenes = 0,  # of dialogs = 0
[*] 16a6a06abf4ff93fe3143b66c45dbeae6550816b:  # of scenes = 0,  # of dialogs = 0
Segmenting:  10%|████████████▍                                                                                                                   | 42/433 [00:00<00:04, 86.93it/s][*] 173b52e21786674aef19e640355e4a6fc869b34e:  # of scenes = 0,  # of dialogs = 0
[*] 18bcdf16fb896faf7fb36db6b752f16f0343ad24:  # of scenes = 0,  # of dialogs = 0
[*] 197cf28c98cf416fed651b7c569073249b37f855:  # of scenes = 0,  # of dialogs = 0
[*] 1995b44b685baf07ae17d56bb810ab8295785010:  # of scenes = 0,  # of dialogs = 0
[*] 199ec80c97f752cd8e00d2ca6f8e5f48603fefb8:  # of scenes = 0,  # of dialogs = 0
[*] 19dd268e42d372d73072c40bc3b8d1c46e07a3a2:  # of scenes = 0,  # of dialogs = 0
[*] 1a020bc35fe8ff93de0a570f7cf70e958880241c:  # of scenes = 0,  # of dialogs = 0
[*] 1a447099edd43fe60f1bf8f2b1eabbd11a5c3989:  # of scenes = 0,  # of dialogs = 0
[*] 1a7119c0cbb4a82b93913600187c98fc9dfa52b0:  # of scenes = 0,  # of dialogs = 0
Segmenting:  12%|███████████████▎                                                                                                                | 52/433 [00:00<00:04, 88.01it/s][*] 1ae015eaa8cd3b66136578e947afc72ed338fe10:  # of scenes = 0,  # of dialogs = 0
[*] 1b48982220584874f6abed587219f6d73e85abd8:  # of scenes = 0,  # of dialogs = 0
[*] 1b8ca5c74e79ccf56eee42155d467c89c8445019:  # of scenes = 0,  # of dialogs = 0
[*] 1da161b6007d46d85eda16ae391e1e78218fabd2:  # of scenes = 0,  # of dialogs = 0
[*] 1ddea1c3f13183fbe029834b5a3ab505600c5dfa:  # of scenes = 0,  # of dialogs = 0
[*] 1f8e03c7b6a6864108933fba1906455f78e4cfa6:  # of scenes = 0,  # of dialogs = 0
[*] 21140bb7b7f2e0c9ccadf799b60b577685c42647:  # of scenes = 0,  # of dialogs = 0
[*] 2114ee4681976de8a9d75d0e411ccc47bfa1caa3:  # of scenes = 0,  # of dialogs = 0
Segmenting:  14%|██████████████████                                                                                                              | 61/433 [00:00<00:04, 87.76it/s][*] 214cb5277750c1ccddec0b10fa545c3c78c45f64:  # of scenes = 0,  # of dialogs = 0
[*] 21813f2122fca16d05e89d44f4521f7da8a8f3b7:  # of scenes = 0,  # of dialogs = 0
[*] 2453d062843edc379bdae3be69859e18bf1abd9d:  # of scenes = 0,  # of dialogs = 0
[*] 266f5b2295980ad31b5090a6c51b69055c87b3a7:  # of scenes = 0,  # of dialogs = 0
[*] 26b13fb4ac397ec6d550e209c9979116de71f467:  # of scenes = 0,  # of dialogs = 0
[*] 26da72cfd563728fc90c2a808a9c6f076fa4e815:  # of scenes = 0,  # of dialogs = 0
Segmenting:  16%|████████████████████▋                                                                                                           | 70/433 [00:00<00:04, 82.10it/s][*] 27ad3a0a84489109ca5d3cc4bf3b8b74a7b8ed9a:  # of scenes = 0,  # of dialogs = 0
[*] 2ba38d31381278d2ae8e90a2a3108debaef877bb:  # of scenes = 0,  # of dialogs = 0
[*] 2c7a9ffe4c06039bd8fee47b6235443267863686:  # of scenes = 0,  # of dialogs = 0
[*] 2e15003e58afe663f66622edbfc3d7d1c7c69a60:  # of scenes = 0,  # of dialogs = 0
[*] 2e24e035fddb9b96430c9bb2cf45d44facac6fc8:  # of scenes = 0,  # of dialogs = 0
[*] 2fc2cfddbe1ca72a7c2ae750a6c8b656dad60f9b:  # of scenes = 0,  # of dialogs = 0
Segmenting:  19%|███████████████████████▉                                                                                                        | 81/433 [00:00<00:03, 88.45it/s][*] 2fe5752889751cbb4bbeab3deba4f6e315f90f0a:  # of scenes = 0,  # of dialogs = 0
[*] 307f5583cf63457100b2d3f616d669e03a501196:  # of scenes = 0,  # of dialogs = 0
[*] 317fe46b2903da739b059733cd618db6ca49a494:  # of scenes = 0,  # of dialogs = 0
[*] 324138e357569216a5fae897cea8cba4548626c4:  # of scenes = 0,  # of dialogs = 0
[*] 33852b452ef4d43a9aa4afbeb543b6a3107e7209:  # of scenes = 0,  # of dialogs = 0
[*] 341bfd64f674b6410af0efd4fe9ab0c5c728d82d:  # of scenes = 0,  # of dialogs = 0
[*] 35891de62bab83d5b312ddeb835c7e0b245e3282:  # of scenes = 0,  # of dialogs = 0
[*] 35ae42f3419e73b7c7357c222e494eaa4161026b:  # of scenes = 0,  # of dialogs = 0
[*] 361364f57460139410c4130a1e7a58caf152c2bd:  # of scenes = 0,  # of dialogs = 0
Segmenting:  21%|███████████████████████████▍                                                                                                    | 93/433 [00:01<00:03, 95.72it/s][*] 3634471ed994ee7d4f382d8e7edbc56de5c28c42:  # of scenes = 0,  # of dialogs = 0
[*] 375d6609a90580a1ce888d45ac697464e6870010:  # of scenes = 0,  # of dialogs = 0
[*] 37b90142ab00baef4ece1ee4896b8933cd8cdb61:  # of scenes = 0,  # of dialogs = 0
[*] 386e3ca25d1aabc56f5a7eaf9714badb8ec86382:  # of scenes = 0,  # of dialogs = 0
[*] 38e24416d39a0a285ef1693adad25c9ed0c94487:  # of scenes = 0,  # of dialogs = 0
[*] 399f3571af53d150a999da5553de856ddb0815b9:  # of scenes = 0,  # of dialogs = 0
[*] 39c9fc154b2cc4030d2732fab674fc246eb28aed:  # of scenes = 0,  # of dialogs = 0
Segmenting:  24%|██████████████████████████████▏                                                                                                | 103/433 [00:01<00:03, 88.35it/s][*] 3a82541bb17890577626925533f6549efe5cf08e:  # of scenes = 0,  # of dialogs = 0
[*] 3b2427be8d3bbfe55590d1ce307f778ede5bbcd9:  # of scenes = 0,  # of dialogs = 0
[*] 3c78ea07497de438f4fcddc0d49051a7a6fa5490:  # of scenes = 0,  # of dialogs = 0
[*] 3d248aa8bba34b3f5199c1aed1443b9fa3395d03:  # of scenes = 0,  # of dialogs = 0
[*] 3e87eee49ffe1cbcc871f2310489775ffcaccbb2:  # of scenes = 0,  # of dialogs = 0
[*] 3feb46d105b7ef3bdb248064761dc309c1831466:  # of scenes = 0,  # of dialogs = 0
[*] 403479540337f035a011f0ea84b4770e31e18207:  # of scenes = 0,  # of dialogs = 0
Segmenting:  26%|████████████████████████████████▊                                                                                              | 112/433 [00:01<00:03, 85.00it/s][*] 407624e45c153cdd848f539d37ca8b5d448f9580:  # of scenes = 0,  # of dialogs = 0
[*] 40ae9851099e8fc4d00c6d3fed1549ae50a75bd0:  # of scenes = 0,  # of dialogs = 0
[*] 411d53d3c5c42990aebb7cb0bf4964f8d0a6f0bc:  # of scenes = 0,  # of dialogs = 0
[*] 42d253275a8807aa6ecf57c6c306cb24d76710f1:  # of scenes = 0,  # of dialogs = 0
[*] 43059307f3d694292ba9178da8fc3e1fe470531f:  # of scenes = 0,  # of dialogs = 0
[*] 43c4885ee70fb0f0871001cbe555520bdf2c3375:  # of scenes = 0,  # of dialogs = 0
[*] 43fce60036da73632775421436a2c2a1f970c755:  # of scenes = 0,  # of dialogs = 0
[*] 447d97a7439de3811d9b6f4dfd5685e09f5fb727:  # of scenes = 0,  # of dialogs = 0
[*] 44cd7b845627af65bf84e6ce3bfc8cfab5bf7b7c:  # of scenes = 0,  # of dialogs = 0
[*] 45c9fe33a1c1348c3ba212834bd2807100c6721a:  # of scenes = 0,  # of dialogs = 0
Segmenting:  28%|███████████████████████████████████▊                                                                                           | 122/433 [00:01<00:03, 88.10it/s][*] 46381add305c73e6d4625548324615b11dfb25c8:  # of scenes = 0,  # of dialogs = 0
[*] 46a5ee8bbf57f56ad5472e0712e98370b734145d:  # of scenes = 0,  # of dialogs = 0
[*] 46c83ab910bbf794c07735c7d55af442a54d090b:  # of scenes = 0,  # of dialogs = 0
[*] 48266045f2dbe4de0cea552d3ec8ffb541c5e182:  # of scenes = 0,  # of dialogs = 0
[*] 492f4b276ebddc0e391f1ce39201849e38a2df20:  # of scenes = 0,  # of dialogs = 0
[*] 49981df2afcb8a656e10eb87b3acd859783eb046:  # of scenes = 0,  # of dialogs = 0
[*] 4a1fff119d01e5ede3da8b3c13c3925484720533:  # of scenes = 0,  # of dialogs = 0
Segmenting:  30%|██████████████████████████████████████▋                                                                                        | 132/433 [00:01<00:03, 90.09it/s][*] 4dcfd38222f416db7aa1add3ecd85e2c9d6160d1:  # of scenes = 0,  # of dialogs = 0
[*] 4f17e590323c45bf12f789e5990d6ab90698671a:  # of scenes = 0,  # of dialogs = 0
[*] 4f485054f9d450534fddba184f0996e32575d1be:  # of scenes = 0,  # of dialogs = 0
[*] 5041b6dbfc48abb92a0c118fcb358a3da92bef34:  # of scenes = 0,  # of dialogs = 0
[*] 5170566c90131e41863b393cc12d1df9f9c17cc4:  # of scenes = 0,  # of dialogs = 0
[*] 51f52296a99ce50b6ca69aa5939d81a5aea4e042:  # of scenes = 0,  # of dialogs = 0
[*] 523a2eb1ae686d7bf0e664c89d0a490a7e1a22bc:  # of scenes = 0,  # of dialogs = 0
[*] 5339e9db4aca0b74b644e736274989344864f0ba:  # of scenes = 0,  # of dialogs = 0
Segmenting:  33%|██████████████████████████████████████████▌                                                                                    | 145/433 [00:01<00:02, 99.48it/s][*] 55d1940b8c8e1c73e175bac2fce0a4f9844fff02:  # of scenes = 0,  # of dialogs = 0
[*] 567c1a39aff9590875a843a9df35f8c5b880ae27:  # of scenes = 0,  # of dialogs = 0
[*] 5685121eb6af89082a190b3383cdee15a2ae83ff:  # of scenes = 0,  # of dialogs = 0
[*] 570c8d99794d019e34b3ead4b4fcf80bb0fac459:  # of scenes = 0,  # of dialogs = 0
[*] 5806789ebca66b68e86317d0e09b8a433c236280:  # of scenes = 0,  # of dialogs = 0
[*] 5ad9844f125d7051ba23edbb8b48a74f4f6102c8:  # of scenes = 0,  # of dialogs = 0
[*] 5c1d04428ffd3f0ddd732a723e61371e0255aa49:  # of scenes = 0,  # of dialogs = 0
Segmenting:  36%|█████████████████████████████████████████████▊                                                                                 | 156/433 [00:01<00:02, 96.32it/s][*] 5e68bb2ae2af335f5f828966209fb6e97621005f:  # of scenes = 0,  # of dialogs = 0
[*] 5ea4ebbc0d86c3f629932a6f2470949776e58579:  # of scenes = 0,  # of dialogs = 0
[*] 5f4f9df9f707d382b33c73db83b316a60599a6ea:  # of scenes = 0,  # of dialogs = 0
[*] 5fd696aa16bb2f03de7af3be71ca9047d3a82935:  # of scenes = 0,  # of dialogs = 0
[*] 6031da6fa93ad9cac1b6da6586010aab81c7b4da:  # of scenes = 0,  # of dialogs = 0
[*] 60ea304651d0be17e4d8e51fa1cb896437021d6a:  # of scenes = 0,  # of dialogs = 0
Segmenting:  38%|████████████████████████████████████████████████▋                                                                              | 166/433 [00:01<00:02, 91.46it/s][*] 634abf83f11ee4449864e1c391b0456b3fdbdf64:  # of scenes = 0,  # of dialogs = 0
[*] 64e583bde7ea2b98c40e03756a32ce31be036af2:  # of scenes = 0,  # of dialogs = 0
[*] 65fdaaca567f74e28159d1c6a7b6cfeec34316ff:  # of scenes = 0,  # of dialogs = 0
[*] 677cda46d079c6df650914eda8d9da1dcda8bf8d:  # of scenes = 0,  # of dialogs = 0
[*] 67ca24897196449395daa9886c7fbaceab55c964:  # of scenes = 0,  # of dialogs = 0
[*] 681b019e94056cbe4f7a13c8323eae0a65ebdbec:  # of scenes = 0,  # of dialogs = 0
[*] 683105161212ee6e3f95eb2857623ad77b088af9:  # of scenes = 0,  # of dialogs = 0
[*] 68ee401e0c66832834f605b625d5062b06a59515:  # of scenes = 0,  # of dialogs = 0
Segmenting:  41%|███████████████████████████████████████████████████▌                                                                           | 176/433 [00:01<00:02, 90.49it/s][*] 69099d7d543fad22f61d8acf97c681c9c86cac0e:  # of scenes = 0,  # of dialogs = 0
[*] 6a02d46e87865ba5b033c56c658af2bfdd182093:  # of scenes = 0,  # of dialogs = 0
[*] 6a14d1ba1aa1a6719840cb98a8feb6a711291aa4:  # of scenes = 0,  # of dialogs = 0
[*] 6a2f59a81c3730b5e1d5f685bac8bbbe547b7b3b:  # of scenes = 0,  # of dialogs = 0
[*] 6aa4c6be53eff0024e5c84f99ac94cddff4eb8f0:  # of scenes = 0,  # of dialogs = 0
[*] 6bec7c2bdd0b01296bc9020288d833709e54cd52:  # of scenes = 0,  # of dialogs = 0
[*] 6c2c91cc1e5aa0a597e6c5c0d7029e8672c0f60d:  # of scenes = 0,  # of dialogs = 0
[*] 6d2e1b95ed2a00e16e046b6f3d0b03e687c7f7f7:  # of scenes = 0,  # of dialogs = 0
Segmenting:  43%|██████████████████████████████████████████████████████▌                                                                        | 186/433 [00:02<00:02, 85.65it/s][*] 6d925e92824fabd7513cd864687c29d6ee3e5c2d:  # of scenes = 0,  # of dialogs = 0
[*] 6ed5d860df4edd3d32f1aca52ec369f0d21039a0:  # of scenes = 0,  # of dialogs = 0
[*] 6ffb5d981d101fbbd43d97bc45720388570f2c61:  # of scenes = 0,  # of dialogs = 0
[*] 70794150f324949ca49f182db0d3f8d69d0c779e:  # of scenes = 0,  # of dialogs = 0
[*] 70b3f55e376d461b1cb7dc5005c02091478e2e44:  # of scenes = 0,  # of dialogs = 0
[*] 71ce19cf034c830c1e2d8b98682ca1d53ead1067:  # of scenes = 0,  # of dialogs = 0
[*] 7235e7853e8ea8ba99b6d7a386d8de03b3887ace:  # of scenes = 0,  # of dialogs = 0
Segmenting:  45%|█████████████████████████████████████████████████████████▊                                                                     | 197/433 [00:02<00:02, 89.93it/s][*] 7415641a4aa3cc0b71657573197bfc9d48694e03:  # of scenes = 0,  # of dialogs = 0
[*] 78e0e28b686c7157d598082bfaa8aaaab821b78b:  # of scenes = 0,  # of dialogs = 0
[*] 7b85063db83a2ba9cba23510c61d769626b604a3:  # of scenes = 0,  # of dialogs = 0
[*] 7c41a74fb0e568bd5fc658282facbf79bf541271:  # of scenes = 0,  # of dialogs = 0
[*] 7c531dc41880b96b8acfaf66a39a85b91d68b636:  # of scenes = 0,  # of dialogs = 0
[*] 7cce97e00d2de0f9c3a19e5ac5c05f6449a77642:  # of scenes = 0,  # of dialogs = 0
Segmenting:  48%|████████████████████████████████████████████████████████████▋                                                                  | 207/433 [00:02<00:02, 86.59it/s][*] 7ef394b11230baf81f61e790839fa993c6ea1f72:  # of scenes = 0,  # of dialogs = 0
[*] 7f1cb0e615795ed6be5d96ea3f13ce62921d8835:  # of scenes = 0,  # of dialogs = 0
[*] 7f4bce4058ac3cf2ff5cfb36908c8e6d891797e1:  # of scenes = 0,  # of dialogs = 0
[*] 81d2ece8e55ac0a2799aa87a43117f01bbd7506d:  # of scenes = 0,  # of dialogs = 0
[*] 81db4f3e5fe29c02fd7b7a702aa4847db6a04613:  # of scenes = 0,  # of dialogs = 0
Segmenting:  50%|███████████████████████████████████████████████████████████████▋                                                               | 217/433 [00:02<00:02, 88.76it/s][*] 83a1fd492021ceb110451d65888072daf64a5d4f:  # of scenes = 0,  # of dialogs = 0
[*] 84327ef84b778b11993de1d2e3f6fb04eeb09fff:  # of scenes = 0,  # of dialogs = 0
[*] 84732f85b51dfbfed6c40f2bc1e35e1697eade8e:  # of scenes = 0,  # of dialogs = 0
[*] 8501cab146742babe04bc3984eb34409d97078a1:  # of scenes = 0,  # of dialogs = 0
[*] 854dd2f347a89b5653a9e8372541af6fc3590254:  # of scenes = 0,  # of dialogs = 0
[*] 85591366e6f0ab31815c40d55e1d4d5182d20ec5:  # of scenes = 0,  # of dialogs = 0
Segmenting:  52%|██████████████████████████████████████████████████████████████████▎                                                            | 226/433 [00:02<00:02, 89.07it/s][*] 88545fa9a5ab807238066a7ab8867ac3adbcd03b:  # of scenes = 0,  # of dialogs = 0
[*] 88cce939b62c833842ccfc1e0fa7534288626c86:  # of scenes = 0,  # of dialogs = 0
[*] 8928a603a3c6d154ea4547060805a966ce0f4d60:  # of scenes = 0,  # of dialogs = 0
[*] 8b1ac9ad821c24ab658e4977aa169ff195f4967f:  # of scenes = 0,  # of dialogs = 0
[*] 8b9da16420edd653ae5e0e2925dd3cade3a21ebc:  # of scenes = 0,  # of dialogs = 0
[*] 8cea29a5fd9e324e9ff07ab2e4a1521d591cb318:  # of scenes = 0,  # of dialogs = 0
[*] 8dddfb1c4aa33c5821670ba20549ec02aba73056:  # of scenes = 0,  # of dialogs = 0
Segmenting:  55%|█████████████████████████████████████████████████████████████████████▌                                                         | 237/433 [00:02<00:02, 93.57it/s][*] 91075ab5383e8113d86b17536c6918bfbbd6af20:  # of scenes = 0,  # of dialogs = 0
[*] 91293cd81f021de45cffd363ef81dd95d2c122a3:  # of scenes = 0,  # of dialogs = 0
[*] 916835cb4bcb3baa6333e7cca25bef7710dbdcbc:  # of scenes = 0,  # of dialogs = 0
[*] 92134f0c9dc82e7b2cc9afd5896ae8dc7d6d088e:  # of scenes = 0,  # of dialogs = 0
[*] 9292406d5193d3402195df7e5647fd168da2d15a:  # of scenes = 0,  # of dialogs = 0
[*] 93116caf60209c52bca7cd0b51ccc366eb90f6c6:  # of scenes = 0,  # of dialogs = 0
[*] 94f1c8eb8ce7f271eb52c6ae9071ae1b56dabfcb:  # of scenes = 0,  # of dialogs = 0
Segmenting:  58%|████████████████████████████████████████████████████████████████████████▋                                                     | 250/433 [00:02<00:01, 102.04it/s][*] 95315365bf6b07073562333fa15403f68e040e70:  # of scenes = 0,  # of dialogs = 0
[*] 953a454c146650e09d27fdb4dbe3cfdc6a9ca697:  # of scenes = 0,  # of dialogs = 0
[*] 987d3e31dc8b6029a2b9450243e122c8862bbc24:  # of scenes = 0,  # of dialogs = 0
[*] 9990281afdfbf883d53ac7540acfcbb375380f75:  # of scenes = 0,  # of dialogs = 0
[*] 999a532b45f030c8f382f0f92fc51b3d12fd821c:  # of scenes = 0,  # of dialogs = 0
[*] 9a1f8183b31ba8ab9f1719fa42b36a173035beb4:  # of scenes = 0,  # of dialogs = 0
[*] 9c2bb97cbbb8dca3fb1d85fbe1abeb27ad046615:  # of scenes = 0,  # of dialogs = 0
Segmenting:  60%|████████████████████████████████████████████████████████████████████████████▌                                                  | 261/433 [00:02<00:01, 90.42it/s][*] 9cbe9d08ff6673e8dba308ac11ba88d71b425209:  # of scenes = 0,  # of dialogs = 0
[*] 9cd25b973d253386eaebbb7f2f7821dc7518f6d6:  # of scenes = 0,  # of dialogs = 0
[*] 9d8b79107628c4dedae91fa3648e56ae94b997fc:  # of scenes = 0,  # of dialogs = 0
[*] 9d8ddfe86beba149b3463eeb1bae92919e179fed:  # of scenes = 0,  # of dialogs = 0
[*] 9e1475e4ee95eb7b4dca656e5afde2aaf5fdda8a:  # of scenes = 0,  # of dialogs = 0
[*] 9ebb84bdc9cc6d698ccc331437bd1ec3b5f0dddb:  # of scenes = 0,  # of dialogs = 0
[*] 9f83c8e49f5a53b211caf37cbdc659f97d2ef30a:  # of scenes = 0,  # of dialogs = 0
Segmenting:  63%|███████████████████████████████████████████████████████████████████████████████▊                                               | 272/433 [00:02<00:01, 93.21it/s][*] a13cfb713f7eca7e750b8ad20b946b142aaa5dbf:  # of scenes = 0,  # of dialogs = 0
[*] a18e921ea3e947642147754dffa769f9eabb31e7:  # of scenes = 0,  # of dialogs = 0
[*] a32f354788dd1f1411b9745200cb330d9b556373:  # of scenes = 0,  # of dialogs = 0
[*] a34a30243522a835f826fe5269b4d234c5443ff3:  # of scenes = 0,  # of dialogs = 0
[*] a3d22e30a6afde892a65e16db0454093a232da87:  # of scenes = 0,  # of dialogs = 0
[*] a3f5043d31f3d18b625f75f69392834d4479df38:  # of scenes = 0,  # of dialogs = 0
Segmenting:  65%|██████████████████████████████████████████████████████████████████████████████████▋                                            | 282/433 [00:03<00:01, 90.78it/s][*] a493bc348052f6962bb423ad44b2e13134d635fc:  # of scenes = 0,  # of dialogs = 0
[*] a65c4ede45179964de0abc38a88aecd767eaf505:  # of scenes = 0,  # of dialogs = 0
[*] a69fee0515cdb067fec5c42fa88ace5c9639118c:  # of scenes = 0,  # of dialogs = 0
[*] a8549480950ac906c9426b7d8cb7963e52e4cd6c:  # of scenes = 0,  # of dialogs = 0
[*] a8771e0706c5b98b55c60b8dc3b9668295315712:  # of scenes = 0,  # of dialogs = 0
[*] a87d47ee243b5cc4f8ce3acf483ccf9e77083e16:  # of scenes = 0,  # of dialogs = 0
Segmenting:  67%|█████████████████████████████████████████████████████████████████████████████████████▋                                         | 292/433 [00:03<00:01, 86.28it/s][*] a912a7308035d8c7136ca7439bce977ed57951f4:  # of scenes = 0,  # of dialogs = 0
[*] acdeff3068372e962ae2600b4883e5cd62c04150:  # of scenes = 0,  # of dialogs = 0
[*] ad0ec287e1d4d99b7ed905a7765e56f79eecb9ef:  # of scenes = 0,  # of dialogs = 0
[*] ad63a9aa1afa22bf51451da2e2f45d9543c9ca62:  # of scenes = 0,  # of dialogs = 0
[*] add8e2efd820ef2778c746e2d3d6c0cd8c650672:  # of scenes = 0,  # of dialogs = 0
[*] aecd2050091b2c8f57190bc9a1c0a1d81d6b2f56:  # of scenes = 0,  # of dialogs = 0
[*] b106547216886fe269040370774d80b5a5a53318:  # of scenes = 0,  # of dialogs = 0
[*] b1b67dc8de94763e66312262449f6cd55956f1f0:  # of scenes = 0,  # of dialogs = 0
Segmenting:  70%|████████████████████████████████████████████████████████████████████████████████████████▌                                      | 302/433 [00:03<00:01, 87.40it/s][*] b236a647b02cd1b003edbf96ab501e8d6aea6c6d:  # of scenes = 0,  # of dialogs = 0
[*] b31c8b60d28467403c1615ec883b167a0c103835:  # of scenes = 0,  # of dialogs = 0
[*] b4010f552ee32bf8b2f4dca248f176b0bf2602b1:  # of scenes = 0,  # of dialogs = 0
[*] b5038cd75a0f275ec87cd993eba3c2af3731bc6c:  # of scenes = 0,  # of dialogs = 0
[*] b50c5113b46d135015347f095947181fbe5fe30f:  # of scenes = 0,  # of dialogs = 0
[*] b598355c6a88e44b884cac7c54b9c955af552e7e:  # of scenes = 0,  # of dialogs = 0
[*] b61a35a3025b1b738c257af2a2b5af9a52222304:  # of scenes = 0,  # of dialogs = 0
[*] b6ed74a969cdbb5ddc34575df8b7ee9d955d5556:  # of scenes = 0,  # of dialogs = 0
Segmenting:  72%|███████████████████████████████████████████████████████████████████████████████████████████▊                                   | 313/433 [00:03<00:01, 91.20it/s][*] b833410a8ff29952ca664319cb3462f2fd07d4f9:  # of scenes = 0,  # of dialogs = 0
[*] b961b2d182f89c9c4259ffd8001df871ead32ef0:  # of scenes = 0,  # of dialogs = 0
[*] ba74352102b49e66de4485552e80012fe38adb20:  # of scenes = 0,  # of dialogs = 0
[*] bd14fef15878fdac1e9c2d2dbe52df0951f38aad:  # of scenes = 0,  # of dialogs = 0
[*] bddb5d539d43bd3d63ca9ff7266e424eb5854521:  # of scenes = 0,  # of dialogs = 0
[*] bde72e62aabedf831fc81b5847152a244fd97bee:  # of scenes = 0,  # of dialogs = 0
[*] bf2e4b5bd8d06325ffe3fcee4af99b9d95ee34e4:  # of scenes = 0,  # of dialogs = 0
[*] bf438dd002b209fa4550cd56752c6549428fc4bc:  # of scenes = 0,  # of dialogs = 0
[*] bfd50b1cf73709308ab4ad727d829c5fca23480c:  # of scenes = 0,  # of dialogs = 0
[*] c002791c4ff779710ca7ba8d8cde2ac4b27d28b3:  # of scenes = 0,  # of dialogs = 0
Segmenting:  75%|██████████████████████████████████████████████████████████████████████████████████████████████▋                                | 323/433 [00:03<00:01, 91.00it/s][*] c0e05b3aac173e39064f07ccb60fb0a30430f824:  # of scenes = 0,  # of dialogs = 0
[*] c2e872845e99f82ddfa5a3ed84f985a04b6729e6:  # of scenes = 0,  # of dialogs = 0
[*] c3d05fedec86ea11bb70837d54b941589bde4d88:  # of scenes = 0,  # of dialogs = 0
[*] c5384ac7f6a3e69a17ede247235936b934a71a03:  # of scenes = 0,  # of dialogs = 0
[*] c551edfbb8240501afd55b17495674b9a04060e3:  # of scenes = 0,  # of dialogs = 0
Segmenting:  77%|█████████████████████████████████████████████████████████████████████████████████████████████████▋                             | 333/433 [00:03<00:01, 93.45it/s][*] c65f47d3de4510d418357b1f133d3171f0bc4eca:  # of scenes = 0,  # of dialogs = 0
[*] c72161c89a7dc8ea5d62b200689bd2acae6f354d:  # of scenes = 0,  # of dialogs = 0
[*] c7c075c49018828bf6027da5c5534834779d1adf:  # of scenes = 0,  # of dialogs = 0
[*] c96a12df9414dca862b7f1d5882dadb40e152121:  # of scenes = 0,  # of dialogs = 0
[*] cb456274929bf53ca118acacea7175e9f25c99bd:  # of scenes = 0,  # of dialogs = 0
[*] cbb15e9755f01a017965f239be1cd3b9277f69ed:  # of scenes = 0,  # of dialogs = 0
[*] cc49ee763ad73ea914a925f7dddf3687b77a69c9:  # of scenes = 0,  # of dialogs = 0
[*] ccde92fd5de1e67f17e10d9e0cd3375ce4efcf23:  # of scenes = 0,  # of dialogs = 0
Segmenting:  79%|████████████████████████████████████████████████████████████████████████████████████████████████████▌                          | 343/433 [00:03<00:00, 92.15it/s][*] ccdf8c8c07e95675fae3591714061ecacfd5ad2e:  # of scenes = 0,  # of dialogs = 0
[*] cd13eb380843224f66c34047cc06dc445a92f8fd:  # of scenes = 0,  # of dialogs = 0
[*] ce23d22d3c3b9297322ea9573f356987ecdeeeb6:  # of scenes = 0,  # of dialogs = 0
[*] ce8cb184a11535e7a7c824c82b7772a1c3a7c92c:  # of scenes = 0,  # of dialogs = 0
[*] cf873fb685ac6b1bd09c733ad9b0c0130d109454:  # of scenes = 0,  # of dialogs = 0
[*] d053fe3c068b8a68d07d7384056afd16935e608a:  # of scenes = 0,  # of dialogs = 0
[*] d079071fecafe63e2939d8e866c36819c21b907c:  # of scenes = 0,  # of dialogs = 0
[*] d094b01390b1598e80f7fd148ef0987e882f04ab:  # of scenes = 0,  # of dialogs = 0
Segmenting:  82%|███████████████████████████████████████████████████████████████████████████████████████████████████████▊                       | 354/433 [00:03<00:00, 95.08it/s][*] d15686cf4482b52351e990ccd991fefda8d2f6dc:  # of scenes = 0,  # of dialogs = 0
[*] d1960641caba4c85c372a2177e6727ad948c7005:  # of scenes = 0,  # of dialogs = 0
[*] d226b0c7fb662f93cc3298e3caa21212f59e0a36:  # of scenes = 0,  # of dialogs = 0
[*] d2e4700544066553a0c434d4861e9b7c4cdfbd7b:  # of scenes = 0,  # of dialogs = 0
[*] d48a25702aca65bfc7755c1dbcb5c196593af1ee:  # of scenes = 0,  # of dialogs = 0
[*] d5b32abd0fe5966b8c619084932c5d832d51f063:  # of scenes = 0,  # of dialogs = 0
[*] d66fe35ce1d4d1166add716e366c04a84618cabe:  # of scenes = 0,  # of dialogs = 0
Segmenting:  84%|██████████████████████████████████████████████████████████████████████████████████████████████████████████▊                    | 364/433 [00:04<00:00, 93.35it/s][*] d6c728cc9fabd2ef68dabc990731470f455e8fac:  # of scenes = 0,  # of dialogs = 0
[*] d75d97726ac6b229b809cb7e482606024c2e564a:  # of scenes = 0,  # of dialogs = 0
[*] d8844d709aa624a5ffe70f185dc68488839d37ea:  # of scenes = 0,  # of dialogs = 0
[*] d8aeeba694332530d1ca1647779c3228959aa20a:  # of scenes = 0,  # of dialogs = 0
[*] d984035756201895c1acd9233775031cc9c0a30c:  # of scenes = 0,  # of dialogs = 0
[*] d9df8732f4fad8d4ffa6d8b2f7af12ea374a5be2:  # of scenes = 0,  # of dialogs = 0
[*] dbb845a9465690011b39ffd4408c5a41db58d97a:  # of scenes = 0,  # of dialogs = 0
[*] dbc088fbc6dd9efb6b6b7d8821f73eb0f1759db4:  # of scenes = 0,  # of dialogs = 0
Segmenting:  87%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████▉                 | 375/433 [00:04<00:00, 95.51it/s][*] dc8d6c5a9a9cb0ee6cc3b47ed9aa7a6f6209d05e:  # of scenes = 0,  # of dialogs = 0
[*] dcd3468c3e18822f08ff0607edfb3e9ca06f3ef0:  # of scenes = 0,  # of dialogs = 0
[*] ddd55023a3dd6800331b50c560f74390f85f1e06:  # of scenes = 0,  # of dialogs = 0
[*] deae2c2c3964684550d73f691762da489f9782f7:  # of scenes = 0,  # of dialogs = 0
[*] dfea98678342e17dbfce44c7906602788cc2267c:  # of scenes = 0,  # of dialogs = 0
[*] e0c74cdf270ebe29a2139e7319fc7314738c88ee:  # of scenes = 0,  # of dialogs = 0
[*] e1c042c57411f230068ededfd7b27e44c0580700:  # of scenes = 0,  # of dialogs = 0
[*] e292eae2486862f6df6ff388cb2dd6777bc73f27:  # of scenes = 0,  # of dialogs = 0
Segmenting:  89%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████▉              | 385/433 [00:04<00:00, 94.18it/s][*] e3793c8072528b1d92b8614113e6d2b5748652cd:  # of scenes = 0,  # of dialogs = 0
[*] e4081796caf2b0354f1fc78626b7a74396979e5b:  # of scenes = 0,  # of dialogs = 0
[*] e60d35663e8d38fa4bde3bff0690ab2ca735fd74:  # of scenes = 0,  # of dialogs = 0
[*] e6ff33cf1eb66a9bcffffbeb0866ec6ccee7f3af:  # of scenes = 0,  # of dialogs = 0
[*] e80dcfbc4d200c173d6ac969a9b160a40a1edf70:  # of scenes = 0,  # of dialogs = 0
[*] ea5d07dd2150a3e4fd5199ab496074839a019ded:  # of scenes = 0,  # of dialogs = 0
[*] ea6f69c29b491c58796029a66f029e552db2819d:  # of scenes = 0,  # of dialogs = 0
Segmenting:  91%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████▊           | 395/433 [00:04<00:00, 90.31it/s][*] ec14a0a6712974227acf09046812d4c51fef364a:  # of scenes = 0,  # of dialogs = 0
[*] ec5123faf9944ed8ef012cce89123db124242b3d:  # of scenes = 0,  # of dialogs = 0
[*] ee7e2ef2ecfa84682214c65ed178f959eaffb8ea:  # of scenes = 0,  # of dialogs = 0
[*] ef722cf82033c8e66197209f06a9cb9754be78d9:  # of scenes = 0,  # of dialogs = 0
[*] ef92e6a5b6fe08813c84e8349ddd2cb2dc842bc7:  # of scenes = 0,  # of dialogs = 0
[*] f130ad4c5c491e444e60dddc228e73a592bf8f18:  # of scenes = 0,  # of dialogs = 0
[*] f225a22410b95923cccefe1a5eb04075c4184376:  # of scenes = 0,  # of dialogs = 0
Segmenting:  94%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████▊        | 405/433 [00:04<00:00, 86.16it/s][*] f246970289decf6f7a6bd44088d16b118aeaae8d:  # of scenes = 0,  # of dialogs = 0
[*] f5255dcda3e92492cca0b95687bf01d0908b07b4:  # of scenes = 0,  # of dialogs = 0
[*] f6470b27b43e232e5b4458fb1dd6c194cddb2452:  # of scenes = 0,  # of dialogs = 0
[*] f6de97a4d111d0663b747eb10e123952424786d0:  # of scenes = 0,  # of dialogs = 0
Segmenting:  96%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████▍     | 414/433 [00:04<00:00, 86.42it/s][*] f750ac4984453071cb5e82de093018e4d70a4f8d:  # of scenes = 0,  # of dialogs = 0
[*] f75afa70c82c3f894abccad514688a835e45c600:  # of scenes = 0,  # of dialogs = 0
[*] f7bb9eb9306b79cad4b6466f2ac3dcbd0e5fa63a:  # of scenes = 0,  # of dialogs = 0
[*] f7bf427e41af53409d7907160f7908e723b78eb0:  # of scenes = 0,  # of dialogs = 0
[*] f7f0a6294e5fe018d584fe29c7c661fc2bf1f86e:  # of scenes = 0,  # of dialogs = 0
[*] f865713a51422129cd8d15ea2bb1ac324d65afdb:  # of scenes = 0,  # of dialogs = 0
[*] f8b3d0124f396d92b58e396b6ab8e2368360c27e:  # of scenes = 0,  # of dialogs = 0
[*] f914d107471567d657d0ac815863dfd079c198db:  # of scenes = 0,  # of dialogs = 0
Segmenting:  98%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████▎  | 424/433 [00:04<00:00, 89.20it/s][*] fdfbfabc1a72a0fb7f31ac7ad9e1ced05c838ef0:  # of scenes = 0,  # of dialogs = 0
[*] fe7390fde95a9ec85a35e2de5a869fcc7c7f1a34:  # of scenes = 0,  # of dialogs = 0
[*] fea54b235b1c054d1c90e87f57e3bfb64cbf3a5b:  # of scenes = 0,  # of dialogs = 0
[*] ff53fd53a94f343b8365915645b79d7ad5b1528e:  # of scenes = 0,  # of dialogs = 0
[*] ffae045d630abf7e4c282849d16819ceff60c2b0:  # of scenes = 0,  # of dialogs = 0
[*] ffcf7daee9cda766d2fcf1f6399b29be41876b21:  # of scenes = 0,  # of dialogs = 0
Segmenting: 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 433/433 [00:04<00:00, 90.96it/s]
100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 116/116 [00:00<00:00, 198.34it/s]
[*] # of training samples:  181831
[*] Saved in <./preprocessed/bad_format_imsdb_self_collected_add_space.pkl>
100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 175/175 [00:00<00:00, 41780.69it/s]
[*] # of training samples:  0
[*] Saved in <./preprocessed/by_stats_imsdb_self_collected_add_space.pkl>
100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 125/125 [00:00<00:00, 40928.02it/s]
[*] # of training samples:  0
[*] Saved in <./preprocessed/silver_imsdb_self_collected_add_space.pkl>
Traceback (most recent call last):
  File "parse.py", line 765, in <module>
    script_chunks_df = merge_data()
  File "parse.py", line 541, in merge_data
    silver_df['text'] = silver_df_with_space['text'].values     
  File "/data2/byeongjuncho/anaconda3/lib/python3.8/site-packages/pandas/core/frame.py", line 3163, in __setitem__
    self._set_item(key, value)
  File "/data2/byeongjuncho/anaconda3/lib/python3.8/site-packages/pandas/core/frame.py", line 3242, in _set_item
    value = self._sanitize_column(key, value)
  File "/data2/byeongjuncho/anaconda3/lib/python3.8/site-packages/pandas/core/frame.py", line 3899, in _sanitize_column
    value = sanitize_index(value, self.index)
  File "/data2/byeongjuncho/anaconda3/lib/python3.8/site-packages/pandas/core/internals/construction.py", line 751, in sanitize_index
    raise ValueError(
ValueError: Length of values (0) does not match length of index (455894)

I found problem in code

...
def merge_data():
    ### BookQA part ###
    scene_df = pickle.load(open('./preprocessed/bookQA_NER_add_space.pkl', "rb"))
    # import storedScript.csv
    movie_name_to_id_mapping = pd.read_csv('./narrative_qa/storedScript.csv')
    scene_df = scene_df.merge(movie_name_to_id_mapping, left_on="book_id", right_on="id", how="left")
    scene_df.drop('id', axis=1, inplace=True)
    scene_df = scene_df.rename(columns={'movieName': 'movie_name'})
    scene_df['source'] = 'old'

    ### silver part ###
    silver_df = pickle.load(open('./preprocessed/NER_silver_imsdb_self_collected_no_lower_preds.pkl', "rb"))
    silver_df_with_space = pd.read_pickle('./preprocessed/silver_imsdb_self_collected_add_space.pkl')  ## <=== This file is empty
# silver_df['text1'] = silver_df_with_space['text'].values
    # silver_df = silver_df.drop(columns=['text'], axis=1)
    # silver_df = silver_df.rename(columns= {'text1':'text'})
    # TODO: Is the following line equivalent to the above three?
    # TODO: no "text" in silver_df_with_space, only "sentence"
    silver_df['text'] = silver_df_with_space['text'].values       # <== So This code arise error because "silver_df_with_space" is empty dataframe
    silver_df['movie_name'] = silver_df['book_id'].str.lower().replace('-', ' ')
    silver_df['source'] = 'new'
    silver_df = silver_df.drop(['label'], axis=1)
    silver_df = silver_df.rename(columns={'label': 'predsWithTitle'})
...

Is any solution in this error?

Thank you.

MBTI labels for characters

Hi,
Sorry to bother again.
According to the paper, your work has provided a dataset including character MBTI labels, but I can't find them in the currently published version. Is this part not yet released?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.