2010 Census Is Built on Incomplete and Inaccurate Information

Last week, Census Bureau Director Robert M. Groves said to Fox News that you can “trust 2010 Census data.” What our director fails to tell us is that the two software applications have operational problems that will ultimately lead to inaccurate data. Just spend a day working in PBOCS, the Paper-Based Operational Control System which processes enumerator questionnaires from the field, or MARCS, the Matching Address Review Coding System which shows a data capture of every questionnaire that was scanned at the Baltimore Data Capture Center and you will see the poor quality of work.

census-workers

Thousands upon thousands of questionnaires are being scanned that show conflicting or incomplete data such as: vacant housing units with a population count, incorrect enumerator IDs, occupied housing units with no demographic information and the list goes on.

During the peak of the non-response follow-up (NRFU) phase of 2010 Census operations (around mid May), the Census switched to a shipping application built off a PeopleSoft/Oracle interface in order to take the load off PBOCS. Although this was a good thought in theory, the application allowed questionnaires to be shipped that were not even checked in PBOCS. In the final closeout days of the operation, PBOCS claimed many questionnaires were not checked in even though enumerators fervently claimed they turned them in.

Fortunately some of those were found in MARCS having been received at the data capture center but never scanned for shipping nor checked in. However because there was such a bottleneck sometimes it was few weeks between the time they were shipped and scanned; some questionnaires that never showed in MARCS were re-enumerated. Sometimes PBOCS would just revert some cases back to not being checked in. In a mad dash to finish and meet deadlines enumerators submitted second versions of questionnaires with little or less than accurate data replacing what may or may not have been originally submitted. Immediately after offices finished NRFU, headquarters closed the PBOCS to the local census offices to prevent further glitches.

As it has been mentioned time and time again, the Census never made it clear what constituted a completed questionnaire.

In such a recession, employees were promised more work if they finished quickly so experienced and resourceful field staff took advantage of the three visit rule sometimes making visits in consecutive days or all in a one day before going to a proxy. Local census offices managers, RCC supervisors and managers developed their own rules which were verbally communicated to field staff. These included guesstimating the population count and allowing enumerators to submit Enumerator Questionnaires (EQs) with little or no demographic information. Since performance was purely based on how many questionnaires get checked in; those who submitted hundreds of forms with nothing on the inside of the questionnaire were rewarded with more work.

On the quality assurance end, the staff attempted to examine the data collected for falsification and poor work quality. However reinterview only has been able to find those who intentionally falsified data. An enumerator can submit inaccurate or incomplete data and practically get away with it.

Most enumerators will be tempted to submit inaccurate data when they cannot gain access to the building, speak to a household member or knowledgeable proxy after repeated visits. The reinterview telephone clerks and field staff have to prove definitively by gaining access to the building or speaking to a respondent who said the interview was never conducted. But in reality the reinterview staff can never access the building, or with large apartment buildings sometimes a proxy is asked about hundreds of units and may not remember if the original interview occurred. Most of these bad data cases have little or no information or wrong information: no names, ages, Hispanic origin, race and sometimes not even a person count. But quality assurance staff have either been told to mark them refusals with an unknown population and check them in.

In the rare instance that the Census Bureaus’s quality assurance (QA) operations do suspect data falsification or inaccuracy, finding the culprit is difficult. There are thousands of questionnaires where the enumerator ID numbers are being read incorrectly at data capture. This invites data falsification in two ways. If a questionnaire is found to be inaccurate or falsified then it is impossible to find the culprit. If quality assurance staff does find an enumerator is submitting falsified or inaccurate work, they can not examine the other questionnaires the enumerator completed because many questionnaires do not have a valid enumerator associated with it.

In the current Vacant/Delete check phase of 2010 Census operations, while the agency covered up their own software problems by closing access to PBOCS, they have also created problems. For hundreds of questionnaires where enumerators clearly marked them vacant or deletes without visiting them LCOs cannot access the system to research who actually submitted this erroneous work.

Most of this is happening now in your local census offices across the country as the re-interview phase winds down. This is because of a huge backlog of EQs that were sent into re-interview, hundreds of outliers, and the slowness of MARCS. This inaccurate data is another smear of shame for the Census Bureau. For Dr. Groves to say that we can trust 2010 Census data is merely a cover-up.

COMMENTS

Please let us know if you're having issues with commenting.