Medium severitygateway
Power BI Refresh Error:
UserErrorParquetJavaInvocationException
What does this error mean?
An Azure Data Factory copy activity writing or reading Parquet files failed because the JVM could not start or ran out of memory on the self-hosted integration runtime.
Common causes
- 1The _JAVA_OPTIONS system environment variable on the SHIR host is missing or configured with insufficient heap space
- 2JVM heap space (Xmx) is below 8 GB, which is the minimum recommended for Parquet processing on ADF SHIR
- 3Multiple Java installations on the SHIR host are conflicting
- 4The SHIR host machine does not have the required JRE installed
How to fix it
- 1On the SHIR host machine, set the system environment variable: _JAVA_OPTIONS = -Xms256m -Xmx16g.
- 2Restart all SHIR nodes after setting the environment variable.
- 3Verify the correct JRE is installed (64-bit JRE 8 or later is required for Parquet processing on SHIR).
- 4If multiple Java versions are installed, set the JAVA_HOME environment variable to point to the correct JRE.
- 5After restarting the IR, rerun the pipeline to confirm the fix.
Frequently asked questions
Official documentation: https://learn.microsoft.com/en-us/azure/data-factory/format-parquet