Comments (3)
@ayush-chauhan Sorry to hear that. You can use sparklens in offline mode using the event log history files.
./bin/spark-submit --packages qubole:sparklens:0.2.0-s_2.11
--class com.qubole.sparklens.app.ReporterApp qubole-dummy-arg
/tmp/spark-history/application_1520833877547_0285.lz4 source=history
If it is not too big, can you share the event log file. Will help in understanding the root cause.
from sparklens.
Sorry, there was some issue with my code. I was using multithreading in my code to merge incremental code in parallel. This issue was fixed after I corrected my code.
I have one question though, why sparklens metrics are not useful in the case of multithreading?
from sparklens.
@ayush-chauhan The way sparklens works right now if that it computes the time spent in the driver by subtracting the time spent during the jobs processing from the total job duration. With multithreading, it is hard to define the notion of driver time. Also multithreading in driver is usually accompanied by use of fair-scheduler in spark. We don't have the ability to simulate fair-scheduler right now. Short answer is that it becomes lot harder to understand the application as well as simulate it when we add these additional degrees of freedom.
from sparklens.
Related Issues (20)
- What is the overhead of running sparklens with every job. HOT 2
- Email report generation is not working HOT 4
- Not able to run spark lens on spark history file HOT 1
- Release for Scala 2.12 HOT 4
- support in spark 3.x version HOT 1
- The qubole#sparklens;0.3.2-s_2.11 module is intermittently not found in the SparkPackages repo HOT 5
- sparkles.qubole.com gets timed out and does not open. Not able to upload sparklers JSON file
- Emailing report feature Not Working - Unresponsive post
- resolver-fix
- Not able to see the sparklens.Json File at mentioned Location HOT 1
- pyspark can use sparklens? HOT 1
- Implementation Of StreamingLens Without Changing in Existing Code. HOT 1
- Implementation Of StreamingLens in Existing Spark Streaming Applications
- JAR version Issue while Implementing StreamingLense HOT 1
- Not Able to See StreamingLens Report In Logs.
- analysize spark eventhistory , but per stage metrics, max task mem is all 0.0KB HOT 1
- Error while opening PySpark shell with the package and conf on my local
- Report mail not working HOT 1
- Mismatch in Driverclock time in notbook function and report via sparkjson file
- py4j.protocol.Py4JJavaError: An error occurred while calling None.org.apache.spark.api.java.JavaSparkContext. : java.lang.NoClassDefFoundError: scala/Product$class
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sparklens.