Hello all! I felt the documentation was limited for getting Dremio Server connected to R using ODBC drivers for macOS, so I figured out how to get it working and created an R Markdown walkthrough for anyone else who is interested!
Thank you for reaching out. Yes, I do believe both the tutorial you linked
and the Dremio documentation section for R that shares its code are
deficient and not complete. Here are my concerns/feedback:
In the code you use to connect to Dremio, you use the “RODBC” package.
From what I understand, the “odbc” package has greater support from RStudio
is a superior method of connecting to databases. For instance, in the
benchmarking section in the odbc package documentation, it says: "The odbc package
is often much faster than the existing RODBC and DBI compatible
RODBCDBI packages."
Moreover, RStudio gives a tutorial using odbc as an example of how to
connect to a database see https://rviews.rstudio.com/2017/05/17/databases-using-r/.
Your tutorial also shows the connection using the "Dremio Connector"
driver that is only available for Windows. R is cross-platform and, thus,
if your tutorial only mentions the Dremio Connector software, it leaves
Linux and Mac R users out to dry. This is another reason I went the
odbc route. Having the odbc driver installed in addition to unixODBC
installed on my Mac allowed me to connect to Dremio using the odbc package
R.
These two links are dead in your documentation for R: Install the Dremio
Connector (ODBC) or Dremio JDBC Driver.
This is really great, thanks cojamalo! Given dplyr’s usefulness in keeping data manipulation on the server side, this is very useful for dealing with large datasets. Another useful tip, shown using RJDBC, below, but can equally be done with odbc, is setting up VDS programmatically - useful to rebuild a Dremio service from a code repo:
Thanks for putting up a detailed guide, however, I am stuck at a point as per your guide where we need to modify the “.ini” files at the usr/local directory.