Changed disclosure checks to include NAs for cross-tabulations. #456
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR is to fix a disclosure bug whereby a table value below the threshold can be converted to NA and then identified. The fix entails including the NAs in the disclosure test.
In the following example, there's a variable ('sex_bin') which cannot be read as at least one of the categories has a value below the filter value (3):
So, recode the variable (I know it has 3 categories: 1,2,9 - and here I suspect 9 might be the suspect category):
Then, we can cross-tabulate the two variables - and it works! Look at this:
Now we clearly see that there were some individuals such that n was below the filter value (3) but these don't get picked up by the filter trap as they are now in the NA column. Specifically, there are 2 in COHORT1, 2 in COHORT2 and 1 in COHORT3 - giving a total of 5 subjects with value 9 in the original variable.