Thanks !
There is a new vignette on how to do reprex with dbplyr, specifically to deal with translation issue. See : https://dbplyr.tidyverse.org/articles/reprex.html
Here is a minimal reprex for the change of translation
library(dplyr)
library(dbplyr)
tab <- tbl_lazy(iris, con = simulate_hive())
query <- tab %>%
group_by(Species) %>%
filter(row_number() == 1L)
query %>% show_query()
#> <SQL>
#> SELECT *
#> FROM `df`
#> WHERE (row_number() = 1)
Created on 2019-05-01 by the reprex package (v0.2.1.9000)
I think you can open an issue in the dblyr repo. I use git bisect to find where this comes from and it is this commit that has changed the behavior
Before, base odbc translation where used (base_odbc_agg and base_odbc_win) and it worked with hive. Now, the translation are customized and row_number() is missing, as window function.
You should open the issue explaining all this, and if you know the correct SQL it could be awesome. I don't have a hive database to test and try so it is better if it is you. I can help with a PR if you want - it is rather easy to add some new translation. Hive translation live there
If you prefer that I open the issue, please do not hesitate.
Note for those who don't know git bisect (yet):
it is a very useful tool in git to find which is the first commit to introduce the bug or regression. With this tool, and a reprex it is very efficient!
Recently Jim Hester has done a video on this topic - a great way to discover: video