Bivariate and Multivariate Techniques
Provide a substantive contribution that advances the discussion in a meaningful way by identifying strengths of the posting, challenging assumptions, and asking clarifying questions. Your response is expected to reference the assigned readings, as well as other theoretical, empirical, or professional literature to support your views and writings. Reference your sources using standard APA guidelines.
Peers have responded to:
Data Screening
Save your time - order a paper!
Get your paper written from scratch within the tight deadline. Our service is a reliable solution to all your troubles. Place an order on any task and we will take care of it. You won’t have to worry about the quality and deadlines
Order Paper NowFor this discussion, identify the goals of data screening. Then discuss how you can identify and remedy the following:
Errors in data entry.
Outliers.
Missing data.
PEER RESPONSE 1
Cait Bahr
Warner (2013) discusses the many goals of data screening in Chapter 4 of Applied Statistics from Bivariate Through Multivariate Techniques. Most importantly, data screening is used to monitor problems that may occur in data analyses (Warner, 2013). In data screening, it is necessary to identify all errors in data collection, otherwise, the results and data will be skewed and therefore incorrect. Other goals of data screening include correcting data errors, determining what data is missing and finding extreme outliers (Warner, 2013). Data sampling needs to be addressed in most data entry because there are significant problems that can occur when processing data (Warner, 2013). Some notable problems with data include extreme outliers, missing data, errors in data entry, small sample sizes, and nonlinear relationships in the data (Warner, 2013).
When identifying and remedying issues in data entry it is incredibly important to fix errors in data entry, locate outliers and correct any missing data (Warner, 2013). In order to address errors in data entry, all of the data must first be completely proofread (Warner, 2013). To ensure that the correct data is being entered, it is wise to compare the data being entered to the original data from the experiment (Warner, 2013). Additionally, the numbers being entered should be also compared to logbooks in addition to being checked by multiple individuals (Warner, 2013). In data entry, outliers are normally numbers that are either too high or look unusual (out of place) (Warner, 2013). So outliers do not skew or create false data, many researchers remove outliers prior to data entry (Warner, 2013). Lastly, when there is missing data there are many ways to replace data (Warner, 2013). For one, SPSS allows researchers to find missing data in their application, system missing values (Warner, 2013). Rather than counting the missing data as a form of data, this software instead replaces the missing data with an estimated value or excludes the missing data completely (Warner, 2013). Regardless of how the missing data is corrected, researchers are suggested to report all missing data in their research findings summary (Warner, 2013).
Reference
Warner, R. M. (2013). Applied Statistics From Bivariate Through Multivariate Techniques (2nd ed.). Thousand Oakes, CA: SAGE Publications.
PEER RESPONSE 2
Teddrick McCreary
What are the goals of data screening? When we are working with numbers and data, there can be many different errors, measurements, inconsistencies, missing values or outliers. It is important that when we are researching information regarding any population, we need to run preliminary analyses with the data collection (Warner, 2013). For myself, I found it easier to understand by means of: data screening is the way to inspect the data collected for any errors and correct those errors before the final analysis. This is done by checking over the data, recognizing any outliers, and then dealing with any missing data.
How can I identify and fix errors in data entry, outliers, and missing data?
• Errors in data entry: That really can be avoided by knowing more about where you are getting your information, any experiments, and being aware that there may be errors (Warner, 2013). For myself, with the fact that I collect much of my data through anecdotals. So I am in the classroom, observing and collecting data on frequencies of behaviors, new behaviors, or types of behaviors. One of the main factors of error in my data entry would be the issue of reactivity. Meaning, since I am there and they are aware of my presence, they may be acting different because I am there. This does happen because they can be very happy that I am there and act differently because of that, or on the other hand I could be making them more anxious then they are normally. Either way, this is going to give me different numbers than if I were just to leave it to the staff to collect and document the data I need.
• Outliers: An outlier is data that is extremely different than what others are in the same sample, population, or individuals being surveyed (Warner, 2013). An outlier can be significant due to alerting the researcher that there is an abnormality or an error in the measurements that one is working on. When we find that we have an outlier, we then need to decide if we want to keep or take out the outlier. There are sometimes in which it would be ok to not add the outlier; however, there are times in which this outlier could be showing us a new trend or discovery in the information that we are looking at.
• Missing Data: Typically, we are not going to find the missing data until we enter the information into SPSS. Once we have entered the data, then there are two types of missing data, system missing and human error (Warner, 2013). We can do somethings with missing data. One thing we can do is just leave it alone. The missing values could be small, non-random, or something that we could just create a composite of the item by averaging them together into a new variable. One thing that we have to keep in mind is that with this missing data, SPSS with either use a listwise or pairwise deletion. Another thing we could do is delete the cases with the missing values, this way we have complete data for all of the subjects in the study (Psychwiki, 2011).
References
Psychwiki. (2011). Dealing with Missing Data. Retrieved from Psychwiki.com: http://www.psychwiki.com/wiki/Dealing_with_Missing_Data
Warner, R. M. (2013). Applied Statistics: From Bivariate Through Multivariate Techniques (2nd ed.). Thousand Oaks, CA: Sage.
My Initial Response:
Data Screening
The goals of data screening include:
To check for outliers
To check for missing data
To check for errors in data entry
Outliers can be spotted by either plotting a box/plot or a histogram of the variable. They can be corrected by omitting them in the analysis.
Missing data can be identified by exploring through your data set before commencing analysis. You can opt to delete the missing cases all together or fill in a value like the mean if deletion will result in sample size quality.
Errors in data entry can be spotted by either exploring your data set before analysis, counter checking with the source data or plotting a histogram and checking for inconsistent values. This error can be avoided by one been keen using data entry or using digital data capture media.
References
Warner, R. M. (2013). Applied Statistics From Bivariate Through Multivariate Techniques (2nd ed.). Thousand Oakes, CA: SAGE Publications.
ADDITIONAL INSTRUCTIONS FOR THE CLASS
Discussion Questions (DQ)
Initial responses to the DQ should address all components of the questions asked, include a minimum of one scholarly source, and be at least 250 words.
Successful responses are substantive (i.e., add something new to the discussion, engage others in the discussion, well-developed idea) and include at least one scholarly source.
One or two sentence responses, simple statements of agreement or “good post,” and responses that are off-topic will not count as substantive. Substantive responses should be at least 150 words.
I encourage you to incorporate the readings from the week (as applicable) into your responses.
Weekly Participation
Your initial responses to the mandatory DQ do not count toward participation and are graded separately.
In addition to the DQ responses, you must post at least one reply to peers (or me) on three separate days, for a total of three replies.
Participation posts do not require a scholarly source/citation (unless you cite someone else’s work).
Part of your weekly participation includes viewing the weekly announcement and attesting to watching it in the comments. These announcements are made to ensure you understand everything that is due during the week.
APA Format and Writing Quality
Familiarize yourself with APA format and practice using it correctly. It is used for most writing assignments for your degree. Visit the Writing Center in the Student Success Center, under the Resources tab in LoudCloud for APA paper templates, citation examples, tips, etc. Points will be deducted for poor use of APA format or absence of APA format (if required).
Cite all sources of information! When in doubt, cite the source. Paraphrasing also requires a citation.
I highly recommend using the APA Publication Manual, 6th edition.
Use of Direct Quotes
I discourage overutilization of direct quotes in DQs and assignments at the Masters’ level and deduct points accordingly.
As Masters’ level students, it is important that you be able to critically analyze and interpret information from journal articles and other resources. Simply restating someone else’s words does not demonstrate an understanding of the content or critical analysis of the content.
It is best to paraphrase content and cite your source.
LopesWrite Policy
For assignments that need to be submitted to LopesWrite, please be sure you have received your report and Similarity Index (SI) percentage BEFORE you do a “final submit” to me.
Once you have received your report, please review it. This report will show you grammatical, punctuation, and spelling errors that can easily be fixed. Take the extra few minutes to review instead of getting counted off for these mistakes.
Review your similarities. Did you forget to cite something? Did you not paraphrase well enough? Is your paper made up of someone else’s thoughts more than your own?
Visit the Writing Center in the Student Success Center, under the Resources tab in LoudCloud for tips on improving your paper and SI score.
Late Policy
The university’s policy on late assignments is 10% penalty PER DAY LATE. This also applies to late DQ replies.
Please communicate with me if you anticipate having to submit an assignment late. I am happy to be flexible, with advance notice. We may be able to work out an extension based on extenuating circumstances.
If you do not communicate with me before submitting an assignment late, the GCU late policy will be in effect.
I do not accept assignments that are two or more weeks late unless we have worked out an extension.
As per policy, no assignments are accepted after the last day of class. Any assignment submitted after midnight on the last day of class will not be accepted for grading.
Communication
Communication is so very important. There are multiple ways to communicate with me:
Questions to Instructor Forum: This is a great place to ask course content or assignment questions. If you have a question, there is a good chance one of your peers does as well. This is a public forum for the class.
Individual Forum: This is a private forum to ask me questions or send me messages. This will be checked at least once every 24 hours.