This content has been marked as final. Show 1 reply
The Classification Build node uses a stratification split implementation to insure the build and test data have the same distribution of target values.
Your split does not include this treatment so that accounts for the difference.
We do not have a Stratified Split node for you to use at this point, but we do have a Sample node that performs something like this.
If you set the Sample Node to stratification, you can take a look at the generated sql to get a feel for the implementation.