exploratory-io / exploratory_func Goto Github PK
View Code? Open in Web Editor NEWR functions for Exploratory
License: Other
R functions for Exploratory
License: Other
If new data to predict doesn't have the response column used to build xgboost_binary, this error appears
model frame and formula mismatch in model.matrix()
Provide detail steps to reproduce the issue, including which data you use
What should happen
What really happened
Provide error message or screenshot if available
I want to run the do_cosine_sim.kv()
function for comparing two corpuses, but when loading the exploratory
package I run in to an error that R can't find the function.
> library(exploratory)
Package attached: exploratory v0.3.16 (same as the most recent version available through GitHub).
If you run into errors, please try restarting R.
> do_cosine_sim.kv()
Error in do_cosine_sim.kv() : could not find function "do_cosine_sim.kv"
All help much appreciated.
devtools::check() fails because of insufficient dependency information
I see this error
R CMD check results
1 error | 0 warnings | 0 notes
checking package dependencies ... ERROR
Package suggested but not available for checking: ‘gtrendsR’
Namespace dependencies not required:
‘data.table’ ‘dtw’ ‘foreach’ ‘ggplot2’ ‘iterators’ ‘scales’ ‘zoo’
See section ‘The DESCRIPTION file’ in the ‘Writing R Extensions’
manual.
Warning message:
do_market_impact_.Rd is missing name/title. Skipping
install from github also fails because of this
Provide detail steps to reproduce the issue, including which data you use
What should happen
What really happened
Provide error message or screenshot if available
countrycode function should be faster to use it in colopreth map of Exploratory desktop
Now, we use https://github.com/vincentarelbundock/countrycode directly.
You can try
Todo:
It should return result immediately, not model
I am trying to reproduce the example from the manual
> res <- data.frame("text" = c("this is what it is", "which is better")) %>%
+ do_tokenize(text) %>%
+ do_tfidf(document_id, token)
which is expected to result in:
document_id | token | count_per_doc | count_of_docs | tfidf |
---|---|---|---|---|
1 | is | 2 | 2 | 0.0000000 |
1 | it | 1 | 1 | 0.5773503 |
1 | this | 1 | 1 | 0.5773503 |
1 | what | 1 | 1 | 0.5773503 |
2 | better | 1 | 1 | 0.7071068 |
2 | is | 1 | 2 | 0.0000000 |
2 | which | 1 | 1 | 0.7071068 |
However, I obtain
document_id | token | count_per_doc | count_of_docs | tfidf |
---|---|---|---|---|
1 | is | 2 | 2 | 0.0000000 |
1 | it | 1 | 1 | 0.0000000 |
1 | this | 1 | 1 | 0.7071068 |
1 | what | 1 | 1 | 0.7071068 |
2 | better | 1 | 1 | 0.7071068 |
2 | is | 1 | 2 | 0.0000000 |
2 | which | 1 | 1 | 0.7071068 |
Another strange result is the following:
> data.frame("text" = c("good it was", "is nice she", "good is she")) %>%
+ do_tokenize(text) %>%
+ do_tfidf(document_id,token)
document_id | token | count_per_doc | count_of_docs | tfidf |
---|---|---|---|---|
1 | good | 1 | 2 | 0.327 |
1 | it | 1 | 1 | 0.327 |
1 | was | 1 | 1 | 0.887 |
2 | is | 1 | 2 | 0.327 |
2 | nice | 1 | 1 | 0.327 |
2 | she | 1 | 2 | 0.887 |
3 | good | 1 | 2 | 0.327 |
3 | is | 1 | 2 | 0.327 |
3 | she | 1 | 2 | 0.887 |
where I would expect to find identical values for "it" and "was"...
If you run do_anomaly_detection to data with non-UTC timezone, it fails to find anomaly data.
Provide detail steps to reproduce the issue, including which data you use
What should happen
What really happened
Provide error message or screenshot if available
Currently, scatter plots for coefficients of Linear regression etc. are sorted based on coefficient values of the first facet.
With that default behavior, we could provide options to...
do_cosine_sim.kv
doesn’t work under [email protected]
. Running the function with a dataset that should work fine (a bit tricky to share, but can if need be—just creating the issue to note this is a problem) generates the following:
Error: `unnest_()` is deprecated as of tidyr 1.0.0.
Please use `unnest()` instead.
to align with the change, need to use Inf
as default value for n_max
argument.
Trying to install exploratory package in R 3.6
ERROR: dependency 'anonymizer' is not available for package 'exploratory'
* removing 'C:/Users/proctors/Documents/R/win-library/3.6/exploratory'
Error: Failed to install 'exploratory' from GitHub:
Seems like it was removed?
https://cran.r-project.org/web/packages/anonymizer/index.html
Currently, list_extract supports position for extract target
And there is a requirement for support boolean logic like position >= 4
to extract a value.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.