The current test cases for attribute annotations rely on a hard-coded mapping. This ap

I remember using the <a href="https://docs.python.org/3.11/library/typing.html#typing.

I remember using the <a href="https://docs.python.org/3.11/library/typing

Another note: <a href="https://docs.python.org/3/library/functions.html#eval" rel="nof

[Feature Request]: Parse annotation mapping from the guidelines so it's read-only. about fundus HOT 7 CLOSED

flairnlp commented on July 24, 2024

[Feature Request]: Parse annotation mapping from the guidelines so it's read-only.

from fundus.

Comments (7)

dobbersc commented on July 24, 2024 1

I remember using the typing.get_type_hints function to resolve string literal type-hints to type objects. If you pass a string to this function, it interpreted it as a forward reference. Therefore, if the string literal includes a custom object, e.g. ArticleBody it has to be loaded within the global or local namespace or passed to the function as well.

from fundus.

dobbersc commented on July 24, 2024 1

No way, that would be huge! Sadly i already discarded my progress 😅

Never ever discard any progress made ;).

I've had some fun looking into the internal code of the typing.get_type_hints function. It does not exactly do what we want as stated in the documentation:

Return a dictionary containing type hints for a function, method, module or class object.

We want to do something more simple. Just resolve the string forward reference to a type object. Since the get_type_hints function internally needs to do the same at some point we can get an idea from there. It uses the _eval_type function. This function checks for forward references of the specific wrapper type ForwardRef. Now, the ForwardRef object has an internal _evaluate function. This is exactly what we need. In the end, this function uses a recursive call of the standard eval function. So for us, we could use the internal function of an object that is not meant to be instantiated by the "user" or use eval as long as it works.

Examples:

from datetime import datetime
from typing import Dict, List, Optional, ForwardRef

from src.parser.html_parser import ArticleBody

attribute_annotations: Dict[str, object] = {
    "title": Optional[str],
    "body": ArticleBody,
    "authors": List[str],
    "publishing_date": Optional[datetime],
    "topics": List[str],
}

attribute_string_annotations: Dict[str, str] = {
    "title": "Optional[str]",
    "body": "ArticleBody",
    "authors": "List[str]",
    "publishing_date": "Optional[datetime]",
    "topics": "List[str]",
}

resolved_attribute_annotations: Dict[str, object] = {
    attribute: ForwardRef(annotation)._evaluate(globals(), locals(), frozenset()) for attribute, annotation in attribute_string_annotations.items()
}
assert attribute_annotations == resolved_attribute_annotations


resolved_attribute_annotations: Dict[str, object] = {
    attribute: eval(annotation) for attribute, annotation in attribute_string_annotations.items()
}
assert attribute_annotations == resolved_attribute_annotations

from fundus.

MaxDall commented on July 24, 2024

Update:
I gave it a try using string comparisons between types parsed from the table in the guidelines and the actual annotations cast to a string, but this completely fell apart for most types from the typing module. Especially for Optional types since this itself is nothing but a TypeAlias.

Maybe someone has a better idea how to keep both things synced.

from fundus.

MaxDall commented on July 24, 2024

I remember using the typing.get_type_hints function to resolve string literal type-hints to type objects.

No way, that would be huge! Sadly i already discarded my progress 😅

from fundus.

dobbersc commented on July 24, 2024

Another note: eval() is a dangerous and unsafe function because it may execute arbitrary code. Since we only use this for our very selected use case in the testing environment and not in the code that the actual user receives, I think this is OK.

from fundus.

MaxDall commented on July 24, 2024

I ended up with eval() but ultimately abandoned it. Not because of security concerns - imo it doesn't matter, everyone can execute code through the CI as long as he opens a PR - but because I didn't want to maintain all the unused imports. That's why I switch to comparing the strings in the first place. Maybe living with the imports is the way to go.

from fundus.

dobbersc commented on July 24, 2024

That's a trade-off I guess. So far it only would be the article body. My guess for the future is also that we have way more built-in types rather than custom objects that need an import. Even with imports there is the gain not having to maintain the actual annotion guideline list. Also one would immediately be altered if an import is missing. I don't have a strong opinion, just listing some thoughts.

from fundus.

[Feature Request]: Parse annotation mapping from the guidelines so it's read-only. about fundus HOT 7 CLOSED

Comments (7)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent