Document content extraction and redacting