{smcl} {* 14jun2018}{...} {hi:help pshare} {hline} {title:Title} {pstd}{hi:pshare} {hline 2} Compute and graph percentile shares {title:Syntax} {pstd} Percentile share estimation: {p 8 15 2} {cmd:pshare} [{cmdab:e:stimate}] {varlist} {ifin} {weight} [{cmd:,} {help pshare##estopt:{it:estimate_options}} ] {pstd} Computing contrasts between outcome variables or subgroups: {p 8 15 2} {cmd:pshare} {cmdab:c:ontrast} [{help pshare##contrast:{it:base}}] [{cmd:,} {help pshare##contopt:{it:contrast_options}} ] {p 8 8 2} where {it:base} is the name of the outcome variable or the value of the subpopulation to be used as base. {it:base} may also be {cmd:#1}, {cmd:#2}, or {cmd:#3}, etc. to refer to the 1st, 2nd, or 3rd, etc. outcome variable or subpopulation. See the {helpb pshare##contrast:contrast()} option of {cmd:pshare estimate} for more details {pstd} Drawing a stacked bar chart of the results: {p 8 15 2} {cmd:pshare} {cmdab:s:tack} [{cmd:,} {help pshare##stackopt:{it:stack_options}} ] {pstd} Drawing a histogram of the results: {p 8 15 2} {cmd:pshare} {cmdab:h:istogram} [{cmd:,} {help pshare##histopt:{it:histogram_options}} ] {synoptset 22 tabbed}{...} {marker estopt}{col 5}{help pshare##estoptions:{it:estimate_options}}{col 29}Description {synoptline} {syntab :Main} {synopt :{opt pr:oportion}}report shares as proportions; the default {p_end} {synopt :{opt percent}}report shares as percentages {p_end} {synopt :{opt den:sity}}report shares as densities {p_end} {synopt :{opt sum}}report shares as sums (outcome totals) {p_end} {synopt :{opt ave:rage}}report shares as averages {p_end} {synopt :{opt general:ized}}report generalized shares {p_end} {synopt :{cmdab:norm:alize(}{help pshare##normalize:{it:spec}}{cmd:)}}normalize results with respect to the specified total {p_end} {synopt :{opt gini}}also report Gini coefficient(s) {p_end} {syntab :Percentiles} {synopt :{opt n:quantiles(#)}}use # percentile groups of equal size; default is {cmd:nquantiles(5)} (quintiles) {p_end} {synopt :{opth p:ercentiles(numlist)}}use percentile groups corresponding to the specified cumulative percentages {p_end} {synopt :{cmd:pvar(}{help varname:{it:pvar}}{cmd:)}}base percentile groups on {it:pvar} instead of the outcome variable {p_end} {synopt :{opt step}}determine lorenz ordinates from step function; the default is to employ linear interpolation {p_end} {syntab :Over} {synopt :{opth over(varname)}}compute results for subpopulations defined by the values of {it:varname} {p_end} {synopt :{opt t:otal}}include overall results across all subpopulations; only allowed with {cmd:over()} {p_end} {syntab :Contrast/Graph} {synopt :{cmdab:c:ontrast}[{cmd:(}{help pshare##contrast:{it:spec}}{cmd:)}]}compute differences in percentile shares between outcome variables or subpopulations {p_end} {synopt :{cmdab:s:tack}[{cmd:(}{help pshare##stackopt:{it:options}}{cmd:)}]}draw a stacked bar chart of the results; {it:options} are {help pshare##stackopt:{it:stack_options}} as described below {p_end} {synopt :{cmdab:h:istogram}[{cmd:(}{help pshare##histopt:{it:options}}{cmd:)}]}draw a histogram of the results; {it:options} are {help pshare##histopt:{it:histogram_options}} as described below {p_end} {syntab :SE/SVY} {synopt :{cmd:vce(}{help pshare##vcetype:{it:vcetype}}{cmd:)}}{it:vcetype} may be {cmd:analytic} (the default), {cmdab:cl:uster} {it:clustvar}, {cmdab:boot:strap} or {cmdab:jack:knife} {p_end} {synopt :{opt cl:uster(clustvar)}}synonym for {cmd:vce(cluster} {it:clustvar}{cmd:)} {p_end} {synopt :{cmd:svy}[{cmd:(}{help pshare##svy:{it:subpop}}{cmd:)}]}take account of survey design as set by {helpb svyset}, optionally restricting computations to {it:subpop} {p_end} {synopt :{opt nose}}supress computation of standard errors and confidence intervals {p_end} {syntab :Reporting} {synopt :{opt l:evel(#)}}set confidence level; default is {cmd:level(95)} {p_end} {synopt :{opt nohe:ader}}suppress output header {p_end} {synopt :{opt notab:le}}suppress output table {p_end} {synopt :{opt nogtab:le}}suppress table of Gini coefficients {p_end} {synopt :{help pshare##displayopts:{it:display_options}}}standard reporting options as described in {helpb estimation options:[R] estimation options} {p_end} {synoptline} {p 4 6 2} {opt pweight}s, {opt iweight}s, and {opt fweight}s are allowed; see {help weight}. {synoptset 22 tabbed}{...} {marker contopt}{col 5}{help pshare##contoptions:{it:contrast_options}}{col 29}Description {synoptline} {synopt :{opt r:atio}}compute ratios instead of differences {p_end} {synopt :{opt lnr:atio}}compute logarithms of ratios instead of differences {p_end} {synopt :{cmdab:s:tack}[{cmd:(}{help pshare##stackopt:{it:options}}{cmd:)}]}draw a stacked bar chart of the results; {it:options} are {help pshare##stackopt:{it:stack_options}} as described below {p_end} {synopt :{cmdab:h:istogram}[{cmd:(}{help pshare##histopt:{it:options}}{cmd:)}]}draw a histogram of the results; {it:options} are {help pshare##histopt:{it:histogram_options}} as described below {p_end} {synopt :{help pshare##displayopts:{it:display_options}}}standard reporting options as described in {helpb estimation options:[R] estimation options} {p_end} {synoptline} {synoptset 22 tabbed}{...} {marker stackopt}{col 5}{help pshare##stackoptions:{it:stack_options}}{col 29}Description {synoptline} {syntab :Main} {synopt :{opt vert:ical}}vertical bar plot {p_end} {synopt :{opt hor:izontal}}horizontal bar plot; the default {p_end} {synopt :{opt prop:ortion}}population axis displays proportion, not percent {p_end} {synopt :{opt rev:erse}}order percentile groups from top to bottom, not from bottom to top {p_end} {synopt :{cmd:keep(}{help pshare##stackkeep:{it:list}}{cmd:)}}select and order outcome variables or subpopulations to be included in the graph {p_end} {synopt :{cmdab:s:ort}[{cmd:(}{help pshare##sortopts:{it:options}}{cmd:)}]}order outcome variables or subpopulation by size of top share or as specified by {it:options} {p_end} {synopt :{cmdab:g:ini(}{help format:{it:%fmt}}{cmd:)}}specify format for Gini coefficients; default format is {cmd:%9.3g} {p_end} {synopt :{opt nog:ini}}omit Gini coefficients {p_end} {syntab :Labels/Rendering} {synopt :{cmdab:lab:els(}{help pshare##stacklabels:{it:labels}}{cmd:)}}specify custom axis labels for outcome variables/subpopulations {p_end} {synopt :{cmdab:plab:els(}{help pshare##plabels:{it:labels}}{cmd:)}}specify custom legend labels for percentile groups {p_end} {synopt :{opt barw:idth(#)}}set width of bars; default is {cmd:barwidth(0.75)} {p_end} {synopt :{it:{help barlook_options}}}affect rendition of the plotted bars {p_end} {synopt :{opth p#(barlook_options)}}affect rendition of #th segment of the stacked bars {p_end} {synopt :{cmdab:v:alues}[{cmd:(}{help format:{it:%fmt}}{cmd:)}]}include values of percentile shares as marker labels {p_end} {synopt :{it:{help marker_label_options}}}affect rendition of the values included as marker labels {p_end} {syntab :Add plots} {synopt :{opth "addplot(addplot_option:plot)"}}add other plots to the graph {p_end} {syntab :Y-Axis, X-Axis, Title, Caption, Legend, Overall} {synopt :{it:{help twoway_options}}}any options other than {cmd:by()} documented in {helpb twoway_options:[G] {it:twoway_options}} {p_end} {synoptline} {synoptset 22 tabbed}{...} {marker histopt}{col 5}{help pshare##histoptions:{it:histogram_options}}{col 29}Description {synoptline} {syntab :Main} {synopt :{opt vert:ical}}vertical bar plot; the default {p_end} {synopt :{opt hor:izontal}}horizontal bar plot {p_end} {synopt :{opt prop:ortion}}population axis in proportion, not percent {p_end} {synopt :{cmd:keep(}{help pshare##keep:{it:list}}{cmd:)}}select and order results to be included as subgraphs {p_end} {synopt :{cmd:max(}{it:#}[, {help pshare##maxmin:{it:options}}]{cmd:)}}truncate bars from above {p_end} {synopt :{cmd:min(}{it:#}[, {help pshare##maxmin:{it:options}}]{cmd:)}}truncate bars from below {p_end} {synopt :{cmd:prange(}{it:min} {it:max}{cmd:)}}restrict range of percentile groups to be included in the graph {p_end} {synopt :{cmdab:g:ini(}{help format:{it:%fmt}}{cmd:)}}specify format for Gini coefficients; default format is {cmd:%9.3g} {p_end} {synopt :{opt nog:ini}}omit Gini coefficients from subgraph labels {p_end} {syntab :Labels/Rendering} {synopt :{it:{help barlook_options}}}affect rendition of the plotted bars {p_end} {synopt :{opt step}}draw results as step function instead of bars {p_end} {synopt :{cmdab:adds:tep}[{cmd:(}{it:{help line_options:line_opts}}{cmd:)}]}draw results as step function in addition to bars {p_end} {synopt :{cmdab:spike:s}[{cmd:(}{it:#}{cmd:)}]}draw a series of spikes instead of bars; implies {cmd:noci} {p_end} {synopt :{cmdab:lab:els(}{help pshare##labels:{it:labels}}{cmd:)}}specify custom labels for subgraphs {p_end} {synopt :{cmdab:byopt:s(}{help by_option:{it:byopts}}{cmd:)}}specify how subgraphs are combined {p_end} {synopt :{opt over:lay}}combine results in single plot instead of using subgraphs; implies {cmd:noci} {p_end} {synopt :{cmd:o#(}{help pshare##oopts:{it:options}}{cmd:)}}affect rendition of #th plot; for use with {cmd:overlay} {p_end} {synopt :{cmd:psep}[{cmd:(}{help pshare##psep:{it:labels}}{cmd:)}]}use different styling for each percentile group {p_end} {synopt :{cmd:p#(}{help pshare##popts:{it:options}}{cmd:)}}affect rendition of #th plot; for use with {cmd:psep()} {p_end} {syntab :Confidence intervals} {synopt :{opt l:evel(#)}}set confidence level; not allowed if {cmd:ci()} is {cmd:bc}, {cmd:bca}, or {cmd:percentile} {p_end} {synopt :{cmd:ci(}{help pshare##citype:{it:citype}}{cmd:)}}choose type of bootstrap CI; {it:citype} may be {cmdab:nor:mal} (the default), {cmd:bc}, {cmd:bca}, or {cmdab:p:ercentile} {p_end} {synopt :{cmdab:ciopt:s(}{help pshare##ciopts:{it:options}}{cmd:)}}affect rendition of the plotted confidence spikes; see help {helpb graph twoway rcap} {p_end} {synopt :{opt cib:elow}}place the confidence interval spikes behind the plotted bars {p_end} {synopt :{opt noci}}omit confidence intervals {p_end} {syntab :Add plots} {synopt :{opth "addplot(addplot_option:plot)"}}add other plots to the graph {p_end} {syntab :Y-Axis, X-Axis, Title, Caption, Legend, Overall} {synopt :{it:{help twoway_options}}}any options other than {cmd:by()} documented in {helpb twoway_options:[G] {it:twoway_options}} {p_end} {synoptline} {title:Description} {pstd} {cmd:pshare estimate} computes percentile shares or, more generally, quantile shares for one or several outcome variables or subpopulations from individual level data (grouped data is not supported). Percentile shares are often used in inequality research to study the distribution of income or wealth. They are defined as differences between Lorenz ordinates of the outcome variable. Technically, the observations are sorted in increasing order of the outcome variable and the specified percentiles (quintiles by default) are computed from the running sum of the outcomes. Percentile shares are then computed as differences between percentiles, divided by total outcome (for methodological details see {help pshare##jann2016:Jann 2016}). {pstd} Given the results form {cmd:pshare estimate} for several outcome variables or subpopulations, {cmd:pshare contrast} computes differences in percentile shares between outcome variables or subpopulations. {pstd} {cmd:pshare stack} draws a stacked bar chart of the results from {cmd:pshare estimate} or {cmd:pshare contrast}. One stacked bar is drawn for each outcome variable or subpopulation. {pstd} {cmd:pshare histogram} plots the results from {cmd:pshare estimate} or {cmd:pshare contrast} as a histogram. In case of multiple outcome variables or multiple subpopulations, several subgraphs are drawn. Confidence intervals are included as capped spikes. {pstd} {cmd:pshare} without arguments replays the previous results. Reporting options may be applied. {marker estoptions}{...} {title:Options for pshare estimate} {dlgtab:Main} {phang} {cmd:proportion}, {cmd:percent}, {cmd:density}, {cmd:sum}, {cmd:average}, and {cmd:generalized} select the type of results to be computed. The default is {cmd:proportion}, that is, to report percentile shares as proportions. Use option {cmd:percent} to report percentile shares as percentages. Furthermore, use option {cmd:density} to report densities, defined as outcome shares divided by population shares (so that in a bar chart the areas of the bars are proportional to the outcome shares). Outcome sums (totals) and average outcomes can be requested by options {cmd:sum} and {cmd:average}, respectively. Finally, use option {cmd:generalized} to report generalized percentile shares, defined as differences between generalized Lorenz ordinates. Only one of {cmd:proportion}, {cmd:percent}, {cmd:density}, {cmd:sum}, {cmd:average}, or {cmd:generalized} is allowed. {marker normalize}{...} {phang} {cmd:normalize(}{it:spec}{cmd:)} causes results to be normalized with respect to the specified total (not allowed in combination with {cmd:sum}, {cmd:average}, or {cmd:generalized}). {it:spec} is [{it:over}{cmd::}][{it:total}] {pmore} where {it:over} may be {cmd:.} the subpopulation at hand (the default) {it:#} the subpopulation identified by value {it:#} {cmd:#}{it:#} the {it:#}th subpopulation {cmdab:t:otal} the total across all subpopulations {pmore} and {it:total} may be {cmd:.} the total of the variable at hand (the default) {cmd:*} the total of the sum across all analyzed outcome variables {varlist} the total of the sum across the variables in {varlist} {it:#} a total equal to {it:#} {pmore} {it:total} specifies the variable(s) from which the total is to be computed, or sets the total to a fixed value. If multiple variables are specified, the total across all specified variables is used ({varlist} may contain external variables that are not among the list of analyzed outcome variables). {it:over} selects the reference population from which the total is to be computed; {it:over} is only allowed if the {cmd:over()} option has been specified (see below). Subpopulation sizes (sum of weights) are taken into account for the computation of densities (option {cmd:density}) if {it:over} is provided, so that the densities reflect multiples of the average outcome in the reference population. {phang} {cmd:gini} causes Gini coefficients (a.k.a. concentration indices if {cmd:pvar()} is specified) to be computed and reported in a separate table. Variance estimation for Gini coefficients is not supported. {dlgtab:Percentiles} {phang} {opt nquantiles(#)} specifies the number of (equally sized) percentile groups to be used. The default is to use quintiles, that is, {cmd:nquantiles(5)}. This is equivalent to typing {cmd:percentiles(20 40 60 80)}. {phang} {opth percentiles(numlist)} specifies, as percentages, the percentiles to be used as threshold for the percentile groups. For example, for deciles type {cmd:percentiles(10 20 30 40 50 60 70 80 90)}, or, as a shorthand, {cmd:percentiles(10(10)90)}. To compute shares of the bottom 50%, 50-90%, 90-95%, 95-99%, 99-99.9%, and the top 0.1%, for example, you could type {cmd:percentiles(50 90 95 99 99.9)}. {phang} {cmd:pvar(}{help varname:{it:pvar}}{cmd:)} causes the percentile groups to be based on variable {it:pvar} instead of the outcome variable. That is, observations will be sorted in increasing order of {it:pvar} and percentiles will be determined from the running sum of the outcome variable across this sort order (using averaged values within ties of {it:pvar}). Use this option to analyze relations between different variables (e.g. how wealth is distributed across different income groups). If {opt pvar()} is specified, the computed percentile shares correspond to differences between ordinates of the "concentration curve" of the outcome variable with respect to {it:pvar}. {phang} {opt step} causes the Lorenz ordinates to be determined from the step function of cumulative outcomes. The default is to employ linear interpolation in regions where the step function is flat. {dlgtab:Over} {phang} {opth over(varname)} reports results for each subpopulation defined by the values of {it:varname}. Only one outcome variable is allowed if {cmd:over()} is specified. {phang} {opt total} causes additional overall results across all subpopulations to be reported. {cmd:total} is only allowed if {cmd:over()} is specified. {dlgtab:Contrast/Graph} {marker contrast}{...} {phang} {cmd:contrast}[{cmd:(}{it:spec}{cmd:)}] causes differences in percentile shares to be computed between outcome variables or between subpopulations, where {it:spec} is [{it:base}] [, {cmdab:r:atio} {cmdab:lnr:atio} ] {pmore} To report contrasts as ratios instead of differences, specify the {cmd:ratio} suboption; to report contrast as logarithms of ratios, specify the {cmd:lnratio} suboption. {pmore} If {cmd:over()} is specified together with {cmd:total}, the default is to use the overall total across subpopulations as base for the contrasts. In all other cases, the default is to compute adjacent contrasts (i.e. using the preceding outcome variable or subpopulation as base). Alternatively, specify {it:base} to select the base for the contrasts. {pmore} In case of multiple outcome variables, {it:base} is the name of the outcome variable to be used as base. For example, {com}. pshare estimate y1990 y2000 y2010, contrast(y1990){txt} {pmore} computes differences in percentile shares with respect to {cmd:y1990}. Likewise, if {cmd:over()} is specified, {it:base} is the value of the subpopulation to be used as base. For example, {com}. pshare estimate wage, over(race) contrast(1){txt} {pmore} computes differences with respect to {cmd:race}==1. Alternatively, {it:base} may also be {cmd:#1}, {cmd:#2}, {cmd:#3}, etc. to use the 1st, 2nd, 3rd, etc. outcome variable or subpopulation as the base for the contrasts. For example, {com}. pshare estimate wage, over(race) contrast(#2){txt} {pmore} uses the second subpopulation as base for the contrasts. {phang} {cmd:stack}[{cmd:(}{help pshare##stackoptions:{it:options}}{cmd:)}] draws a stacked bar chart of the results. {it:options} are as described for {helpb pshare##stackoptions:pshare stack} below. {phang} {cmd:histogram}[{cmd:(}{help pshare##histoptions:{it:options}}{cmd:)}] draws a histogram of the results. {it:options} are as described for {helpb pshare##histoptions:pshare histogram} below. {dlgtab:SE/SVY} {marker vcetype}{...} {phang} {opth vce(vcetype)} determines how standard errors and confidence intervals are computed. {it:vcetype} may be: {cmd:analytic} {cmd:cluster} {it:clustvar} {cmd:bootstrap} [{cmd:,} {help bootstrap:{it:bootstrap_options}}] {cmd:jackknife} [{cmd:,} {help jackknife:{it:jackknife_options}}] {pmore} The default is {cmd:vce(analytic)}, using approximate formulas for variance estimation assuming independent data. For clustered data, specify {cmd:vce(cluster} {it:clustvar}{cmd:)}, where {it:clustvar} is the variable identifying the clusters. Methods and formulas are based on Binder and Kovacevic (1995; also see Kovacevic and Binder 1997). For bootstrap and jackknife estimation, see help {it:{help vce_option}}. Variance estimation is not supported if {cmd:iweights} or {cmd:fweights} are specified. {phang} {opt cluster(clustvar)} is a synonym for {cmd:vce(cluster} {it:clustvar}{cmd:)}. {marker svy}{...} {phang} {cmd:svy}[{cmd:(}{it:subpop}{cmd:)}] causes the survey design to be taken into account for variance estimation. Methods and formulas are based on Binder and Kovacevic (1995). The data need to be set up for survey estimation; see help {helpb svyset}. Specify {it:subpop} to restrict survey estimation to a subpopulation, where {it:subpop} is [{varname}] [{it:{help if}}] {pmore} The subpopulation is defined by observations for which {it:varname}!=0 and for which the {cmd:if} condition is met. See help {helpb svy} and {manlink SVY subpopulation estimation} for more information on subpopulation estimation. {pmore} The {cmd:svy} option of {cmd:pshare} only works if the variance estimation method is set to Taylor linearization by {helpb svyset} (the default). For other variance estimation methods you may use the usual {helpb svy} prefix command. For example, you could type {cmd:svy brr: pshare ...} to use BRR variance estimation. {cmd:pshare} does not allow the {helpb svy} prefix for Taylor linearization due to technical reasons. This is why the {cmd:svy} option is provided. {phang} {opt nose} suppresses the computation of standard errors and confidence intervals. Use the {cmd:nose} option to speed-up computations when analyzing population data. The {cmd:nose} option may also be useful to speed-up computations with prefix commands that use replication techniques for variance estimation, such as, e.g., {helpb svy jackknife}. Options {cmd:vce(bootstrap)} and {cmd:vce(jackknife)} imply {cmd:nose}. {dlgtab:Reporting} {phang} {opt level(#)} specifies the confidence level, as a percentage, for confidence intervals. The default is {cmd:level(95)} or as set by {helpb set level}. {phang} {opt noheader} suppresses the output header; only the coefficient table is displayed. {phang} {opt notable} suppresses the coefficient table. {phang} {opt nogtable} suppresses the table containing Gini coefficients. {marker displayopts}{...} {phang} {it:display_options} are standard reporting options such as {cmd:cformat()}, {cmd:pformat()}, {cmd:sformat()}, or {cmd:coeflegend}. See {helpb estimation options:[R] estimation options}. {marker contoptions} {title:Options for pshare contrast} {phang} {cmd:ratio} causes contrasts to be reported as ratios. The default is to report contrasts as differences. {phang} {cmd:lnratio} causes contrasts to be reported as logarithms of ratios. The default is to report contrasts as differences. {phang} {cmd:stack}[{cmd:(}{help pshare##stackoptions:{it:options}}{cmd:)}] draws a stacked bar chart of the results. {it:options} are as described for {helpb pshare##stackoptions:pshare stack} below. {phang} {cmd:histogram}[{cmd:(}{help pshare##histoptions:{it:options}}{cmd:)}] draws a histogram of the results. {it:options} are as described for {helpb pshare##histoptions:pshare histogram} below. {phang} {it:display_options} are standard reporting options such as {cmd:cformat()}, {cmd:pformat()}, {cmd:sformat()}, or {cmd:coeflegend}. See {helpb estimation options:[R] estimation options}. {marker stackoptions} {title:Options for pshare stack} {dlgtab:Main} {phang} {opt vertical} and {opt horizontal} specify whether a vertical or a horizontal bar plot is drawn. The default is to draw a horizontal bar plot. {phang} {opt proportion} scales the population axis as proportion (0 to 1). The default is to scale the axis as percentage (0 to 100). {phang} {cmd:reverse} orders percentile groups from top to bottom (the richest are leftmost, the poorest are rightmost). The default is to order percentile groups from bottom to top (the poorest are leftmost, the richest are rightmost). {marker stackkeep}{...} {phang} {opt keep(list)} selects and orders the results to be included as separate bars. Use {cmd:keep()} with multiple outcome variables or subpopulations. In case of multiple outcome variables, {it:spec} is a list of the names of the outcome variables to be included. In case of {cmd:over()}, {it:list} is a list of the values of the subpopulations to be included. {it:list} may also contain {cmdab:t:otal} for the overall results (if overall results were requested). Furthermore, {it:list} may also contain elements such as {cmd:#1}, {cmd:#2}, {cmd:#3}, etc. to refer to the 1st, 2nd, 3rd, etc. outcome variable or subpopulation. See the {helpb pshare##keep:keep()} option of {cmd:pshare histogram} for examples. {marker sortopts}{...} {phang} {cmdab:s:ort}[{cmd:(}{it:options}{cmd:)}] orders the bars for the different outcome variables or subpopulation by the level of inequality. If {cmd:sort} is specified without argument, the bars are sorted in ascending order of the outcome shares of the top percentile group. The {it:options} for alternative sorting are: {cmdab:g:ini} sort by Gini coefficients {cmdab:d:escending} sort in descending order {cmdab:tl:ast} place total last {cmdab:tf:irst} place total first {phang} {cmd:gini(}{it:%fmt}{cmd:)} sets the format for the Gini coefficients included in the graph as secondary axis labels; see help {helpb format}. The default format is {cmd:%9.3g}. Gini coefficients will only be included if information on Gini coefficients is available in the provided results (i.e. if the {cmd:gini} option has been applied to {cmd:pshare estimate}). {phang} {cmd:nogini} suppresses the Gini coefficients. This is only relevant if the {cmd:gini} option has been specified when calling {cmd:pshare estimate}. {dlgtab:Labels/Rendering} {marker stacklabels}{...} {phang} {opt labels(labels)} specifies custom axis labels for the included outcome variables or subpopulations. The default is to use the variable labels of the outcome variables or the value labels of the subpopulations, respectively. {it:spec} is a list of labels that are applied one-by-one to the displayed bars (from top to bottom, or left to right). Use quotes if a label contains spaces, e.g. {cmd:labels("label one" "label two" ...)}. Type empty string to use the default label for a specific bar. For example, {cmd:labels("label 1" "" "label 3")} specifies custom labels for the first and third bars, and uses default labels for the other bars. {marker plabels}{...} {phang} {opt plabels(labels)} specifies custom labels for percentile groups in the legend. The default is to use labels composed of the values of the lower bound and the upper bound of the group. {it:spec} is a list of labels that are applied one-by-one to the groups (from left to right, or bottom to top). Use quotes if a label contains spaces, e.g. {cmd:labels("label one" "label two" ...)}. Type empty string to use the default label for a specific group. For example, {cmd:labels("label 1" "" "label 3")} specifies custom labels for the first and third groups, and uses default labels for the other groups. {phang} {opt barwidth(#)} sets the width of the bars as proportion of the spacing between bar positions. The default is {cmd:barwidth(0.75)}, leaving white space of 1/3 barwidth between the bars. {phang} {it:barlook_options} are options that affect the rendition of the plotted bars. See {helpb barlook_options:[G] {it:barlook_options}}. {phang} {opt p#(barlook_options)} affects the rendition of #th segment of the stacked bars (corresponding to the #th percentile group). {it:barlook_options} are as described in {helpb barlook_options:[G] {it:barlook_options}}. For example, to use khaki colored bars for the 3rd percentile group, type {cmd:p3(color(khaki))}, or, to print a thick red border around the bars of the 5th percentile group, type {cmd:p5(lcolor(red) lwidth(thick))}. If the {cmd:values()} option has been specified, {it:{help marker_label_options}} are allowed within {cmd:p#()}. {phang} {cmd:values}[{cmd:(}{it:%fmt}{cmd:)}] prints the values of the percentile shares as marker labels at the center of the bar segments and, optionally, set the display format for the values; see help {helpb format}. The default format is {cmd:%9.3g}. Use {it:{help marker_label_options}} to affect the rendition of the labels. To use differential rendition by bar segment, specify {it:marker_label_options} within the {cmd:p#()} options (see above). {phang} {it:marker_label_options} are options that affect the rendition of the values included as marker labels using the {cmd:values()} option. See {helpb marker_label_options:[G] {it:marker_label_options}}. Do not use {cmd:mlabel()} or {cmd:mlabvposition()}. {dlgtab:Add plots} {phang} {opt addplot(plot)} provides a way to add other plots to the generated graph. See {helpb addplot_option:[G] {it:addplot_option}}. {dlgtab:Y-Axis, X-Axis, Title, Caption, Legend, Overall} {phang} {it:twoway_options} are general twoway options, other than {cmd:by()}, as documented in {helpb twoway_options:[G] {it:twoway_options}}. {marker histoptions} {title:Options for pshare histogram} {dlgtab:Main} {phang} {opt vertical} and {opt horizontal} specify whether a vertical or a horizontal bar plot is drawn. The default is to draw a vertical bar plot. {phang} {opt proportion} scales the population axis in terms of proportions (0 to 1). The default is to scale the axis in terms of percentages (0 to 100). {marker keep}{...} {phang} {opt keep(list)} selects and orders the results to be included as subgraphs. Use {cmd:keep()} if {cmd:pshare estimate} has been applied to multiple outcome variables or subpopulations. In case of multiple outcome variables, {it:list} is a list of the names of the outcome variables to be included. Example: {com}. pshare estimate y1990 y2000 y2010 {com}. pshare graph, keep(y2010 y1990){txt} {pmore} In case of {cmd:over()}, {it:list} is a list of the values of the subpopulations to be included. {it:list} may also contain {cmdab:t:otal} for the overall results (if overall results were requested). Example: {com}. pshare estimate wage, over(race) total {com}. pshare graph, keep(total 1 2){txt} {pmore} Furthermore, {it:list} may also contain elements such as {cmd:#1}, {cmd:#2}, {cmd:#3}, etc. to refer to the 1st, 2nd, 3rd, etc. outcome variable or subpopulation. Example: {com}. pshare estimate wage, over(race) {com}. pshare graph, keep(#1 #3){txt} {marker maxmin}{...} {phang} {cmd:max(}{it:#}[, {it:options}]{cmd:)} top-codes results at {it:#} (i.e. truncates the bars from above); {cmd:min(}{it:#}[, {it:options}]{cmd:)} bottom-codes results at {it:#} (i.e. truncates the bars from below). This is useful if there are large differences in the plotted values and you want to restrict the axis range. The truncated values will be included in the graph as marker labels. {it:options} are: {cmdab:f:ormat(}{help format:{it:%fmt}}{cmd:)} set the format (default is {cmd:%9.3g}) {it:{help marker_label_options}} affect rendition of the labels {opt nolab:els} omit the marker labels {phang} {cmd:prange(}{it:min} {it:max}{cmd:)} restricts the range of the percentile groups to be included in the graph. Only results for percentile groups whose lower and upper cumulative population bounds (in percent) are within {it:min} and {it:max} will be plotted. {it:min} and {it:max} must be within [0,100]. For example, to include only the lower half of the distribution, type {cmd:prange(0 50)}. {phang} {opt gini(%fmt)} sets the format for the Gini coefficients included in the subgraph labels; see help {helpb format}. The default format is {cmd:%9.3g}. Gini coefficients will only be included if information on Gini coefficients is available in the provided results (i.e. if the {cmd:gini} option has been applied to {cmd:pshare estimate}). {phang} {cmd:nogini} suppresses the Gini coefficients. This is only relevant if the {cmd:gini} option has been specified when calling {cmd:pshare estimate}. {dlgtab:Labels/Rendering} {phang} {it:barlook_options} are options that affect the rendition of the plotted bars. See {helpb barlook_options:[G] {it:barlook_options}}. {phang} {cmd:step} causes a step function to be drawn instead of histogram bars. {phang} {cmd:addstep}[{cmd:(}{it:line_options}{cmd:)}] causes a step function to be drawn on top of the histogram bars. Specify {it:line_options} to affect the rendering of the step function line; see {helpb line_options:[G] {it:line_options}}. {phang} {cmd:spikes}[{cmd:(}{it:#}{cmd:)}] causes (equally spaced) spikes to be drawn instead of histogram bars. {it:#} specifies the number of spikes. The default is to draw 100 spikes, one for each percentile. Specifying {cmd:spikes} implies {cmd:noci} (see below). {marker labels}{...} {phang} {opt labels(labels)} specifies custom labels for the included subgraphs. The default is to use the variable labels of the outcome variables or the value labels of the subpopulations, respectively. {it:labels} is a list of labels that are applied one-by-one to the subgraphs. Use quotes if a label contains spaces, e.g. {cmd:labels("label one" "label two" ...)}. Type empty string to use the default label for a specific subgraph. For example, {cmd:labels("label 1" "" "label 3")} specifies custom labels for the first and third subgraphs, and uses default labels for the other subgraphs. {phang} {opt byopts(byopts)} determines how subgraphs are combined. {it:byopts} are as described in {helpb by_option:[G] {it:by_option}}. {phang} {cmd:overlay} causes results from the different outcome variables or subpopulations to be included in the same graph instead of using separate subgraphs. {cmd:overlay} and {cmd:psep()} are not both allowed. Specifying {cmd:overlay} implies {cmd:noci} (see below). {marker oopts}{...} {phang} {opt o#(options)} affects the rendition of the bars of the #th outcome variable or subpopulation if {cmd:overlay} has been specified. {it:options} are: {cmd:step} draw step function instead of bars {cmdab:adds:tep}[{cmd:(}{it:{help line_options:line_opts}}{cmd:)}] draw step function in addition to bars {it:{help barlook_options}} affect rendition of the plotted bars {marker psep}{...} {phang} {cmd:psep}[{cmd:(}{it:labels}{cmd:)}] causes different rendering to be used for each percentile group and includes a corresponding legend in the graph. The default is to draw all bars in the same style. {cmd:psep()} and {cmd:overlay} are not both allowed. {marker popts}{...} {phang} {opt p#(options)} affects the rendition of the bars of the #th percentile group if {cmd:psep()} has been specified. {it:options} are: {it:{help barlook_options}} affect rendition of the plotted bars {cmdab:ciopt:s(}{help pshare##ciopts:{it:options}}{cmd:)} affect rendition of the confidence spikes {dlgtab:Confidence intervals} {phang} {opt level(#)} specifies the confidence level, as a percentage, for confidence intervals. The default is the level that has been used for computing the {cmd:pshare} results. {cmd:level()} cannot be used together with {cmd:ci(bc)}, {cmd:ci(bca)}, or {cmd:ci(percentile)}. To change the level for these confidence intervals, you need to specify {cmd:level()} when computing the results. {marker citype}{...} {phang} {opt ci(citype)} chooses the type of CI to be plotted for results that have been computed using the bootstrap technique. {it:citype} may be: {cmdab:nor:mal}{col 25}normal-based CIs; the default {cmd:bc}{col 25}bias-corrected CIs {cmd:bca}{col 25}bias-corrected and accelerated CIs {cmdab:p:ercentile}{col 25}percentile CIs {pmore} {cmd:bca} is only available if BCa confidence intervals have been requested when running {cmd:pshare estimate}. {marker ciopts}{...} {phang} {opt ciopts(options)} specifies options that affect the rendition of the plotted confidence spikes, e.g. {it:{help line_options}}. The available set of options depends on plot type. The default plot type is capped spikes; see help {helpb graph twoway rcap}. Use the {cmd:recast()} option to change the plot type. For example, type {cmd:ciopts(recast(rspike))} for (uncapped) spikes; see {helpb graph twoway rspike}. Available plot types are range plots as listed in {helpb twoway}. {phang} {opt cibelow} causes the confidence interval spikes to be placed behind the plotted bars. The default is to draw the spikes in front of the bars. {phang} {opt noci} omits confidence interval spikes from the plot. {dlgtab:Add plots} {phang} {opt addplot(plot)} provides a way to add other plots to the generated graph. See {helpb addplot_option:[G] {it:addplot_option}}. {dlgtab:Y-Axis, X-Axis, Title, Caption, Legend, Overall} {phang} {it:twoway_options} are general twoway options, other than {cmd:by()}, as documented in {helpb twoway_options:[G] {it:twoway_options}}. {title:Examples} . {stata sysuse nlsw88} . {stata pshare estimate wage} . {stata pshare histogram} . {stata pshare estimate wage, percentiles(20 40 60 70 80 90 95 97 99) density} . {stata pshare histogram, yline(1)} . {stata pshare estimate wage, percentiles(20 40 60 70 80 90 95 97 99) density vce(bootstrap)} . {stata pshare histogram, yline(1)} . {stata pshare estimate wage, over(union)} . {stata pshare histogram, yline(1)} . {stata pshare estimate wage, over(union)} . {stata pshare contrast 0} . {stata pshare histogram, yline(0)} . {stata pshare estimate wage, over(industry) total gini} . {stata pshare stack, sort(gini tlast descending)} {pstd} For further examples see {help pshare##jann2016:Jann (2016)}. {title:Stored results} {pstd} {cmd:pshare estimate} stores the following in {cmd:e()}: {synoptset 20 tabbed}{...} {p2col 5 20 24 2: Scalars}{p_end} {synopt:{cmd:e(N)}}number of observations{p_end} {synopt:{cmd:e(N_over)}}number of subpopulations{p_end} {synopt:{cmd:e(N_clust)}}number of clusters{p_end} {synopt:{cmd:e(k_eq)}}number of equations in {cmd:e(b)}{p_end} {synopt:{cmd:e(bins)}}number of bins (percentile groups) per equation{p_end} {synopt:{cmd:e(df_r)}}sample degrees of freedom{p_end} {synopt:{cmd:e(rank)}}rank of {cmd:e(V)}{p_end} {synopt:{cmd:e(level)}}confidence level for CIs{p_end} {synoptset 20 tabbed}{...} {p2col 5 20 24 2: Macros}{p_end} {synopt:{cmd:e(cmd)}}{cmd:pshare}{p_end} {synopt:{cmd:e(cmdline)}}command as typed{p_end} {synopt:{cmd:e(depvar)}}name(s) of outcome variable(s){p_end} {synopt:{cmd:e(pvar)}}name of variable specified in {cmd:pvar()}{p_end} {synopt:{cmd:e(type)}}{cmd:proportion}, {cmd:percent}, {cmd:density}, {cmd:sum}, {cmd:average}, or {cmd:generalized}{p_end} {synopt:{cmd:e(norm)}}{it:#} or names of reference variables or empty{p_end} {synopt:{cmd:e(normpop)}}{cmd:total} or {it:overvar} {cmd:=} {it:#} or empty{p_end} {synopt:{cmd:e(percentiles)}}percentile thresholds{p_end} {synopt:{cmd:e(step)}}{cmd:step} or empty{p_end} {synopt:{cmd:e(gini)}}{cmd:gini} or empty{p_end} {synopt:{cmd:e(over)}}name of {cmd:over()} variable{p_end} {synopt:{cmd:e(over_namelist)}}values from {cmd:over()} variable{p_end} {synopt:{cmd:e(over_labels)}}labels from {cmd:over()} variable{p_end} {synopt:{cmd:e(total)}}{cmd:total} or empty{p_end} {synopt:{cmd:e(contrast)}}{cmd:contrast} or empty{p_end} {synopt:{cmd:e(baseval)}}{cmd:+} or value/name of base for contrasts{p_end} {synopt:{cmd:e(ratio)}}{cmd:ratio} or empty{p_end} {synopt:{cmd:e(lnratio)}}{cmd:lnratio} or empty{p_end} {synopt:{cmd:e(wtype)}}weight type{p_end} {synopt:{cmd:e(wexp)}}weight expression{p_end} {synopt:{cmd:e(clustvar)}}name of cluster variable{p_end} {synopt:{cmd:e(vce)}}{it:vcetype} specified in {cmd:vce()}{p_end} {synopt:{cmd:e(vcetype)}}title used to label Std. Err.{p_end} {synopt:{cmd:e(title)}}title in estimation output{p_end} {synopt:{cmd:e(properties)}}{cmd:b V} or {cmd:b}{p_end} {synoptset 20 tabbed}{...} {p2col 5 20 24 2: Matrices}{p_end} {synopt:{cmd:e(b)}}estimates (proportions, percent, densities, sums, or averages){p_end} {synopt:{cmd:e(V)}}variance-covariance matrix of estimates{p_end} {synopt:{cmd:e(_N)}}numbers of observations in subpopulations{p_end} {synopt:{cmd:e(G)}}Gini coefficients (if {cmd:gini} is specified){p_end} {synopt:{cmd:e(L_ul)}}upper bounds of Lorenz ordinates{p_end} {synopt:{cmd:e(L_ll)}}lower bounds of Lorenz ordinates{p_end} {synopt:{cmd:e(prop)}}population proportions{p_end} {synopt:{cmd:e(ul)}}upper bounds of cumulative population percentages{p_end} {synopt:{cmd:e(ll)}}lower bounds of cumulative population percentages{p_end} {synopt:{cmd:e(mid)}}midpoints of cumulative population percentages{p_end} {synoptset 20 tabbed}{...} {p2col 5 20 24 2: Functions}{p_end} {synopt:{cmd:e(sample)}}marks estimation sample{p_end} {p2colreset}{...} {pstd} If the {cmd:svy} option is specified, various additional results as described in help {helpb svy} are stored in {cmd:e()}. {title:References} {phang} Binder, D. A., M. S. Kovacevic (1995). Estimating Some Measures of Income Inequality from Survey Data: An Application of the Estimating Equations Approach. Survey Methodology 21(2): 137-145. {marker jann2016}{...} {phang} Jann, B. (2016). {browse "https://www.stata-journal.com/article.html?article=st0432":Assessing inequality using percentile shares}. The Stata Journal 16(2): 264–300. ({browse "http://ideas.repec.org/p/bss/wpaper/13.html":working paper}) {phang} Kovacevic, M. S., D. A. Binder (1997). Variance Estimation for Measures of Income Inequality and Polarization - The Estimating Equations Approach. Journal of Offcial Statistics 13(1): 41-58. {title:Author} {pstd} Ben Jann, University of Bern, jann@soz.unibe.ch {pstd} Thanks for citing this software as follows: {pmore} Jann, B. (2015). pshare: Stata module to compute and graph percentile shares. Available from {browse "http://ideas.repec.org/c/boc/bocode/s458036.html"}. {title:Also see} {psee} Online: help for {helpb pctile}, {helpb graph twoway bar} {psee} From the SSC Archive: {stata ssc describe lorenz:{bf:lorenz}}, {stata ssc describe sumdist:{bf:sumdist}}, {stata ssc describe svylorenz:{bf:svylorenz}}